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Abstract 



The paper gives a systematic review of the basic ideas of (non-relativistic) quantum 
mechanics including all improvements that result from previous work of the authors. 
The aim is to show that the new theory is self-consistent and that it reproduces all 
measurable predictions of standard quantum mechanics. The most important im- 
provements are: 1) A new realist interpretation of quantum mechanics assumes that 
all properties that are uniquely determined by preparations can be viewed as objec- 
tive. These properties are then sufficient to justify the notion of quantum object. 
2) Classical systems are defined as macroscopic quantum objects in states close to 
maximum entropy. For classical mechanics, new states of such kind are introduced, 
the so-called maximum-entropy packets, and shown to approximate classical dynam- 
ics better than Gaussian wave packets. 3) A new solution of quantum measurement 
problem is proposed for measurements that are performed on microsystems. First, 
it is assumed that readings of registration apparatuses are always signals from detec- 
tors. Second, an application of the cluster separability principle leads to a locality 
requirement on observables and to the key notion of separation status. Separation 
status of a microsystem is shown to change in preparation and registration processes. 
This gives preparation and registration new meaning and enhances the importance of 
these notions. Changes of separation status are alterations of kinematic description 
rather than some parts of dynamical trajectories and thus more radical than 'collapse 
of the wave function'. Standard quantum mechanics does not provide any informa- 
tion of how separation status changes run, hence new rules must be formulated. As 
an example of such a new rule, Beltrametti-Cassinelli-Lahti model of measurement 
is modified and shown then to satisfy both the probability-reproducibility and the 
objectification requirements. 
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0.1 Introduction 



Since its beginnings, quantum mechanics has been plagued by two often discussed 
paradoxes. The first one, let us call it 'non-objectivity-of-the measurable', is that 
the values obtained by most measurements on microsystems cannot be assumed to 
exist before the measurements, that is, to be objective properties of the microsystem 
on which the measurement is done. An assumption of this kind of objectivity would 
lead to contradictions under various circumstances (contextuality [71 HQ], Bell in- 
equalities [I], Hardy impossibilities [33], Greenberger-Horne-Zeilinger equality [27]). 
This seduces people into thinking that quantum systems are 'elusive' [56J or 'not 
physical objects' [38] . 

The second one, let us call it 'Schrodinger-cat' paradox, is that application of 
quantum mechanics to interactions between microscopic and macroscopic systems 
leads to results that contradict experience (if artificial augmentations of the theory 
such as 'collapse of the wave function' are forbidden). Indeed, the result often 
predicted by quantum mechanics for such an interaction is that the macroscopic 
system evolves into a linear superposition of macroscopically different states (dead 
and alive cat). This also underlies the problem of quantum measurement: the state 
of a macroscopic apparatus after a measurement as predicted by quantum mechanics 
does not satisfy the objectification requirement [16] . According to the requirement, a 
well-defined value can be read off the apparatus after each individual measurement. 

There is, however, another paradox that has as yet been left rather in shadow. 
Let us call it 'double-ruling' paradox: quantum mechanics has one set of rules for 
dealing with a microscopic system as if it were alone in the world (an isolated system) 
and another set of rules for it as a subsystem of a family of microsystems that are 
identical to it. To avoid contradictions resulting from the double ruling, the idea 
of cluster separability [56J has been invented. However, only short and superficial 
discussion of the idea within (non-relativistic) quantum mechanics seems to exist. 

The present paper deals with the paradoxes. The non-objectivity-of-the- measur- 
able is a fact of life but it itself only implies that some properties are not objective. 
Under certain conditions, microsystems have sufficient number of other properties 
that can be viewed as objective without contradictions [2H]. Under these conditions, 
the microsystems can be considered as physical objects. This also helps to explain 
the emergence of classical properties [30J. The double- ruling and the Schrodinger- 
cat paradoxes are shown to be related. They can both be solved if the nature of 
cluster separability is clarified [3"Tl I3"2] . The meaning and the role of preparations 
and registrations appear then in a new light. In any case, the quantum mechanics 
emerging on the basis of our ideas is at many points different from the usual textbook 
version. This paper gives a short but systematic account of it in order to show that 
it is a coherent theory free from internal contradictions. 
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Any physical theory can be divided into two parts. First, there are basic notions 
and rules that form a kind of language. The language is general so that it is, on the 
one hand, very broadly applicable, and, on the other, unable to make any specific 
numerical predictions. Thus, it is difficult — may be even impossible — to disprove 
by some observational facts. Second, there are models formulated with the help 
of the language that are to explain and predict properties of observed systems. 
Each model introduces assumptions about the structure of the system and values 
of various parameters so that quantitative results can be obtained. These can be 
disproved. For example, in Newtonian mechanics, the notion of state as a point in 
phase space, of forces, the law of gravitation and Newton's dynamical equations can 
form the language. To describe the solar system, assumptions such as the number 
of bodies, their masses and certain class of their trajectories define a model. In the 
present paper, we attempt to separate quantum mechanics into the language and 
the model parts. 

The idea that the main activity of theoreticians is to construct models of observed 
phenomena and that the general rules and notions must be adapted to this aim plays 
an important role in our approach. For example, many textbooks (see, e.g., Ref. 
[56J) maintain that any mathematically existing state of any quantum system can be 
prepared by some physical device, and that any mathematically existing observable 
of it can be registered by some physical device, without specifying the devices. 
The fact that such claims seem to be false for most macroscopic systems leads 
some authors (see, e.g., [48j|49]) to abandon the universality of quantum mechanics. 
However, quantum mechanics would work perfectly well without any of the two 
assumptions. On the one hand, one is free to use any mathematically existing 
state or observable for a model construction and for prediction of its properties. 
Mostly, only few are needed. On the other, it will turn out (see Sees. 2.2.2 and 
6.2) that quantum analysis of the model assumptions itself implies that some of the 
mathematical observables cannot be registered. 

0.1.1 Examples of quantum systems 

To explain what is quantum mechanics about, this section mentions some known 
quantum systems. Thus, it belongs mainly to the model part of theory. However, it 
also introduces some general notions, such as microsystem, macrosystem, structural 
property or type of system. 

Quantum mechanics is a theory that describes certain class of properties of certain 
class of systems in a similar way as any other physical theory does. For example, 
Newtonian mechanics describes bodies that can be considered as rigid in a good 
approximation and studies the motion of the bodies (we consider 'mass point' as an 
idealisation of a small rigid body). 
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Quantum systems that we shall consider are photongj, electrons, neutrons and 
nuclei, which we call particles, and systems containing some number of particles, 
such as atoms, molecules and macroscopic systems, which are called composite. 
Of course, neutrons and nuclei themselves are composite of quarks and gluons, but 
non-relativistic quantum mechanics can and must start from some phenomenological 
description of neutrons and nuclei. We just view them as structureless. 

Let us call particles and quantum systems that are composite of small number of 
particles microsystems. They are extremely tiny and they can never be perceived 
directly by our senses. We can observe directly only macroscopic quantum sys- 
tems that are composed of very many particles (except for photons: the eye can 
recognize signals of several photons). 'Very many' is: not too different from 10 23 
(the Avogadro number). Let us call these macro systems. Some properties of most 
macroscopic systems obey classical theories. For example, shape and position of my 
chair belong to Euclidean geometry, chemical composition of its parts to classical 
chemistry and thermodynamic properties of the parts such as phase or temperature 
to phenomenological thermodynamics. Such properties are called 'classical'. Thus, 
properties of microsystems can only be observed via classical properties of macrosys- 
tems, if microsystems interact with them and this interaction leaves classical traces. 

Microsystems are divided into types, such as electrons or hydrogen atoms. Sys- 
tems of one type are not distinguishable from each other in a sense not existing 
in classical physics. Systems of the same type are often called identical. Microsys- 
tems exist always in a huge number of identical copies. The two properties of 
microsystems, viz. 1) their inaccessibility to direct observations and 2) utter lack 
of individuality that is connected with the existence of a huge number of identical 
copies, make them rather different from classical systems or 'things'. Each classical 
system can be observed directly by humans (in principle: for example, the distant 
galaxies) and each can be labelled and distinguished from other classical systems, 
because it is a quantum system composite of a huge number of particles and hence 
it is highly improbable that it has a relative of the same type in the world. 

Objective properties that are common to all microsystems of the same type will 
be called structural. Thus, each particle has a mass, spin and electric charge. These 
are parameters that must be chosen in constructing models of existing systems 
taking into account their empirical values; they are not provided by the language 
part of theory. For example, each electron has a mass about .5 MeV, a spin 1/2 
and a charge about 10 -19 C. Any composite system consists of definite numbers 
of different particles with their masses, spins and charges. E.g., a hydrogen atom 

^^One can ask the question whether there is a non-relativistic limit of photons. In one such 
limit, photons may move with infinite velocity and their position need not therefore be very well 
defined. In another, photons may be represented by a classical electromagnetic wave. 
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contains one electron and one proton (nucleus). The composition of a system is 
another structural property. The structural properties influence the dynamics of 
quantum systems; the way they do it and what dynamics is will be explained later. 
Only then, it will be clear what the meaning of these parameters is and that the 
type of each system can be recognized, if its dynamics is observed. When we shall 
know more about dynamics, further structural properties will emerge. 

Structural properties are objective in the sense that they can be assumed to exist 
before and independently of any measurement. Such assumption does not lead to 
any contradictions with standard quantum mechanics and is at least tacitly made 
by most physicists. In fact, the ultimate goal of practically all experiments is to find 
structural properties of quantum systems. 

From the formally logical point of view, all possible objective properties of given 
kind of objects ought to form a Boolean lattice. The structural properties satisfy 
this condition: systems with a given structural property form a subset of all sys- 
tems. These subsets are always composed of whole type-classes of quantum systems. 
Clearly, the intersection of two such subsets and the complement of any such subset 
is again a structural property. 

0.1.2 Examples of quantum experiments 

The topic of this section plays an important role in understanding quantum me- 
chanics. Specific examples of typical experiments will be given in some detail. That 
belongs to the model part of theory, but we gain access to some general notions of 
the language part such as preparation and registration. The concept of individual 
quantum object will be introduced and its relation to preparation and registration 
mentioned. 

Let us first consider experiments with microsystems that are carried out in lab- 
oratories. Such an experiment starts at a source of microsystems that are to be 
studied. Let us give examples of sources for each microsystem. 

1. Electrons. One possible source (called field emission, see e.g. Ref. [52], p. 38) 
consists of a cold cathode in the form of a sharp tip and a flat anode with an 
aperture in the middle at some distance from the cathode, in a vacuum tube. 
The electrostatic field of, say, few kV will enable electrons to tunnel from the 
metal and form an electron beam of about 10 7 electrons per second through 
the aperture, with a relatively well-defined energy. 

2. Neutrons can be obtained through nuclear reaction. This can be initiated by 
charged particles or gamma rays that can be furnished by an accelerator or 
a radioactive substance. For example the so-called Ra-Be source consists of 
finely divided RaCl2 mixed with powdered Be, contained in a small capsule. 
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The reaction is produced by the alpha particle of Ra. The yield for 1 mg Ra is 
about 10 4 neutrons per second with broad energy spectrum from small energies 
to about 13 MeV. The emission of neutrons is roughly spherically symmetric 
centred at the capsule. 

3. Atoms and molecules. A macroscopic specimen of the required substance in 
gaseous phase at certain temperature can be produced, e.g. by an oven. The 
gas is in a vessel with an aperture from which a beam of the atoms or molecules 
emerges. 

One can easily observe that each source is completely determined by an arrangement 
of macroscopic bodies of different shapes, chemical compositions and temperatures, 
and that the electric or magnetic fields are determined by their macroscopic charac- 
teristics, such as field intensities: that is, by their classical properties. These prop- 
erties determine uniquely what type of microsystem is produced. Let us call this 
description empirical. It is important that the classical properties defining a source 
do not include time and position so that the source can be reproduced later and 
elsewhere. We call different sources that are defined by the same classical properties 
equivalent. Empirical description is sufficient for reproducibility of experiments but 
it is not sufficient for understanding of how the sources work. If a source defined 
by an empirical description is set into action, we have an instance of the so-called 
preparation. 

Quantum mechanics assumes that these are general features of all sources, inde- 
pendently of whether they are arranged in a laboratory by humans or occur spon- 
taneously in nature. For example, some classical conditions in the centre of the Sun 
lead to emission of neutrinos. 

Often, a source yields very many microsystems of a given type that are emitted 
in all possible directions, a kind of radiation. We stress that the detailed structure 
of the radiation as it is understood in classical physics, that is where each individual 
classical system exactly is at different times, is not determined for quantum systems 
and the question has even no sense. Still, a fixed source gives the microsystems 
that originate from it some properties. In quantum mechanics, these properties are 
described on the one hand by the structural properties that define the prepared type, 
on the other, e.g., by the so-called quantum state. We call a prepared quantum state 
quantum object. One preparation creates one individual quantum object. However, 
a statement about a quantum object make sense only if it is independent of the 
individuality of the quantum microsystem that occupies the corresponding state. 

To determine the quantum state that results from a preparation with a given 
empirical description in each specific case requires the full formalism of quantum 
mechanics. Hence, we must postpone this point. 
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After arranging the source, another stage of the experiment can start. Generally, 
only a very small part of the radiation from a source has the properties that are 
needed for the planned experiment. The next step is, therefore, to select the part 
and to block off the rest. This is done by the so-called collimator, mostly a set of 
macroscopic screens with apertures and macroscopic electric or magnetic fields. For 
example, the electron radiation can go through an electrostatic field that accelerates 
the electrons and through electron-microscope 'lenses', each followed by a suitable 
screen. A narrow part of the original radiation, a beam, remains. For example, a 
beam of molecules obtained from an oven can also contain parts of broken molecules 
including molecules with different degrees of ionisation. The part with suitable 
composition can then be selected by a mass spectrometer and the rest blocked 
off by a screen. Again, the beam resulting from a raw source and a collimator 
consists of individual quantum objects with a well-defined type and quantum state. 
The process of obtaining these individual quantum objects is another example of 
preparation. Again, there is an empirical description that defines an equivalence 
class of preparations and equivalent preparations can be repeated. 

The final beam can be characterized not only by the quantum state of individual 
objects but also by its approximate current, that is how many individual objects it 
yields per second. The beam can be made very thin. For example, in the electron- 
diffraction experiment [67J, the beam that emerges from the collimator represents 
an electric current of about 10~ 16 A, or 10 3 electrons per second. As the approximate 
velocity of the electrons and the distance between the collimator and the detector 
are known, one can estimate the average simultaneous number of electrons there at 
each time. In the experiment, it is less than one. One can understand in this way 
that it is an experiment with individual electron objects. 

Next, the beam can be lead through further arrangement of macroscopic bodies 
and fields. For example, to study the phenomenon of diffraction of electrons, each 
electron object can be scattered by a thin slab of crystalline graphite or by an 
electrostatic biprism interference apparatus. The latter consists of two parallel plates 
and a wire in between with a potential difference between the wire on the one hand 
and the plates on the other. An electron object runs through between the wire and 
both the left and right plate simultaneously and interferes with itself afterwards (for 
details see [Q7\). Again, the beam from the graphite or the biprism can be viewed 
as prepared by the whole arrangement of the source, collimator and the interference 
apparatus. This is another example of preparation procedure. 

Finally, what results from the original beam must be made directly perceptible 
by its interaction with another system of macroscopic bodies and fields. This pro- 
cess is called registration and the system registration apparatus. The division of an 
experimental arrangement into preparation and registration parts is not unique. For 
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example, in the electron-diffraction experiment, one example of a registration ap- 
paratus begins after the biprism interference, another one includes also the biprism 
interference apparatus. Similarly to preparations, the registrations are defined by 
an empirical description of their relevant classical properties in such a way that 
equivalent registrations can be repeated. 

An important definition property of a registration is that it is applicable to an 
individual quantum object and that each empirical result of a registration is caused 
by just one individual object. In the above experiment, this assumption is made 
plausible by the extreme thinning of the beam, but it is adopted in general even if 
the beam is not thin. 

An empirical description of a registration apparatus can determine a quantum 
mechanical observable. Again, more theory is needed for understanding of what 
are observables and how they are related to registration devices. Each individual 
registration performed by the apparatus, i.e., registration performed on a single 
quantum object, then gives some value of the observable. The registration is not 
considered to be finished without the registration apparatus having given a definite, 
macroscopic and classical response. 

A part of registration apparatuses for microsystems are detectors. At the em- 
pirical level, a detector is determined by an arrangement of macroscopic fields and 
bodies, as well as by the chemical composition of its sensitive matter [33]. For ex- 
ample, in the experiment [67] on electron diffraction, the electrons coming from the 
biprism interference apparatus are absorbed in a scintillation film placed transver- 
sally to the beam. An incoming electron object is thus transformed into a light 
signal. The photons are guided by parallel system of fibres to a photo-cathode. The 
resulting (secondary) electrons are accelerated and lead to a micro-channel plate, 
which is a system of parallel thin photo-multipliers. Finally, a system of tiny anodes 
enables to record the time and the transversal position of the small flash of light 
in the scintillation film. In this way, each individual electron object coming from 
the biprism is detected at a position (two transversal coordinates determined by the 
anodes and one longitudinal coordinate determined by the position of the scintilla- 
tion film). This triple of numbers is the result of each registration and the value 
of the corresponding observable, which is the position in this case. Also the time 
of the arrival at each anode can be approximately determined. Thus, each position 
obtains a certain time. 

An important observation is the following. When an electron that has been 
prepared by the source and the collimator hits the scintillation film, it is lost as an 
object. Indeed, there is no property that would distinguish it from other electrons 
in the scintillation matter. Thus, this particular registration is a process inverse 
to preparation: while the preparation has created a quantum object with certain 
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individuality, the registration entails a loss of the object. Our theory of quantum 
registration in Chap. 6 will generalise this observation. 

If we repeat the experiment with individual electron objects many times and 
record the transversal position coordinates, the gradual formation of the electron 
interference pattern can be observed. The pattern can also be described by some 
numerical values. For example, the distance of adjacent maxima and the direction 
of the interference fringes can be such values. Still, the interference pattern is not a 
result of one but of a whole large set of individual registrations. 

In some sense, each electron object must be spread out over the whole plane of 
the scintillation detector after coming from the biprism but the excitation of the 
molecules in the detector matter happens always only within a tiny well-localized 
piece of it, which is different for different objects. Thus, one can say that the 
interference pattern must be encoded in each individual electron object, even if it is 
not possible to obtain the property by a single registration. The interference pattern 
can be considered as an objective property of the individual electron objects prepared 
by the source, the collimator and the biprism interference apparatus. On the other 
hand, the hitting position of each individual electron object cannot be considered as 
its objective property. Such an assumption would lead to contradictions with results 
of other experiments. The position must be regarded as created in the detection 
process. The interference pattern is not a structural property: preparations that 
differ in the voltage at some stage of the experiment (e.g., the accelerating field in 
the collimator or the field between the wire and the side electrodes in the biprism 
interference apparatus, etc.) will give different interference patterns. We call such 
objective properties dynamical. 

Some structural properties can be measured directly by a registration (on individ- 
ual quantum objects) and their values are real numbers. For example, mass can be 
measured by a mass spectrometer. Such structural properties can be described by 
quantum observables (see Sec. 1.2)0. However, there are also structural properties 
that cannot be directly measured on individual objects similarly to the interference 
pattern, such as cross sections. They cannot be described by observables: the value 
of a cross section is not a value of an observable. 

0.1.3 Realism 

The fact that the description at empirical level is sufficient for reproducibility of 
experiments and for the account of their results has been used to construct a kind 
of realist interpretation of quantum mechanics. This interpretation maintains that 

2 These observables must commute with all other observables ([48], IV. 8), and can be associated 
with the so-called superselection rules, see Sec. 3.3. 
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the domain of real world to which quantum mechanics is applicable are exactly 
the classical properties of macroscopic devices described at the empirical level. A 
rigorous formulation is Ref. [H]. The macroscopic systems are not considered as 
quantum systems. Such interpretation does not even need to assume that there are 
any microsystems [IS]. This is in agreement with statements such as Bohr's 'there 
is no quantum concept', see Ref. [17], pp. 33-57. 

There are two reasons for such a cautious approach. First, the values of observ- 
ables concerning a quantum object S that are read off as the classical properties of a 
macroscopic device A after its interaction with the quantum object are not objective 
properties of the quantum object S. If one assumes that such values are the only 
'properties' any quantum systems can exhibit or that each objective property must 
be accessible by an individual registration, then quantum systems do not possess 
enough objective properties to be physical objects (for a mathematical proof, see 
Ref. [IS], p. 60). Second, there is neither a completely satisfactory quantum theory 
of all classical properties of macrosystems nor a generally accepted understanding 
of quantum measurement process as yet. 

The cautious approach has its drawbacks. The question naturally arises, what is 
the physics that prevents macroscopic systems to be quantum systems. After all, 
they are composite of quantum particles. The explanation offered in Refs. [1EJ US] 
by G. Ludwig is as follows. Ludwig's method is to postulate certain statements on 
preparation and registration first and then to derive quantum mechanics from it. 
For such method to be successful, some completeness of the starting set of prepa- 
ration and registration properties is needed. This, however, leads to problems: the 
macroscopic systems described by classical theories yield examples that do not ad- 
mit such completeness (see also Chap. 5 of the present paper). Thus, if preparations 
and registrations are to form the domain of reality of a macroscopic system that is 
to be described by quantum mechanics, then quantum mechanics is not applicable 
(see Ref. [IS], Vol. 2, p. 16). On the one hand, the argument underpins the idea 
that the cautious approach is self-consistent, on the other, it only shows that the 
approach itself is inapplicable to macroscopic systems and leaves the question open 
whether there is another more comprehensive approach. 

These and other problems motivate our attempt at an alternative theory. Sur- 
prisingly enough, another realist interpretation that is compatible with all physical 
principles of standard quantum mechanics is possible. The only difficulty are cer- 
tain prejudices and restrictions motivated, in fact, by classical mechanics. Thus, 
our deviation from classical thinking (with the exception of realism) is even greater 
than that of other interpretations of quantum mechanics. 

The general hypothesis of realism says that the world is not just an idea in our 
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mind but does really exiscl This is an utterly metaphysical hypothesis, which 
means that it cannot be disproved by any observational facts. Only if some further 
assumptions about the world are involved then it is indeed possible to prove in some 
cases that such a world does not exist. 

Our approach to properties of quantum systems is necessarily different from those 
that can be found in literature in order that it can circumvent some no-go theorems 
concerning objective properties (see, e.g., Ref. [IE]). First, we extend the notion of 
properties to include complex ones in the following sense: 

1. Their values may be arbitrary mathematical objects (sets, maps between sets, 
etc.). For example, the Hamiltonian of a quantum system involves a relation 
between energy and some other observables of the system. This relation is an 
example of such a complex property. 

2. Their values need not be directly obtained by individual registrations. For 
example, to measure a cross-section a whole series of scattering experiments 
must be done. Thus, their values need not possess probability distribution but 
may be equivalent to, or derivable from, probability distributions. 

Second, we weaken the relation of objective properties to the basic quantum-mecha- 
nical concepts of preparation and registration. What is objective? A popular opinion 
is ([48J, p. 60): 

An objective property should refer directly to the microsystem itself and 
be independent of preparation and registration process. By this we mean 
that a set a [of systems] which is selected by a preparation (and similarly 
a set b which is selected by a registration) may be divided according to 
objective properties into subsets... 

The 'independence' does not mean that properties are not manipulable or observable. 
In fact, if we look at Newton mechanics, a mass point can be put into a space point x 
with a momentum p so that it is then objectively in the state (x,p), and measurement 
procedures exist that can inform us in which state it objectively is in the moment of 
measurement. One can say that all classical objective properties can be prepared and 
can be obtained as results of measurements. The problem in quantum mechanics 
is that a registration in most cases disturbs the microsystem so that the result of 
the registration is only created during the registration process. The result of the 
registration cannot thus be assumed to refer directly to the microsystem and to 
be independent of the registration process. Hence, a set of prepared microsystem 

3 Of course, we are a part of the world and can influence it. Thus, the world is objective but 
not completely independent of us. 



12 



objects cannot already be subdivided into subsets according to the properties that 
will only be registered later. 

It is clear then that the objective properties of microsystems, if there are any, 
cannot be directly related to registrations. To confirm an objective property, many 
registration procedures often carried out by several registration devices must be 
done on many copies of equally prepared objects. 

Any criterion of objectivity must therefore ignore registration. We are thus led 
to the following: 

Basic Ontological Hypothesis of Quantum Mechanics A property is objective 
if its value is uniquely determined by a preparation according to the rules of standard 
quantum mechanics. The 'value' is the value of the mathematical expression that 
describes the property and it may be more general than just a real number. No 
registration is necessary to establish such a property but a correct registration cannot 
disprove its value; in many cases, registrations can confirm the value. 

The motivation for the hypothesis is that such a property can be assumed to 
be possessed by the prepared object without either violating any rules of standard 
quantum mechanics or contradicting possible results of any registration performed 
on the prepared object. Any set of equally prepared objects can, therefore, always 
be subdivided into subsets according to such objective properties. It follows that 
the properties satisfy Boolean lattice rules. The relation of registrations to such 
objective properties must be studied for each property separately. 

We shall discus the objective properties in more detail in Chap. 1 and see that 
they can be divided into structural and dynamical. Already now, we stress: with the 
help of the above criterion, we obtain enough objective properties to characterize 
quantum systems completely (at least from the standpoint of standard quantum 
mechanics). Moreover, there will be objective properties of quantum systems that 
can play the role of classical properties of macroscopic quantum systems. 

A few words have to be said on ontological hypotheses. As it is well-known, the 
objective existence of anything cannot be proved (even that of the chair on which 
I am now sitting, see, e.g., Ref. [22], where this old philosophical tenet is explained 
from the point of view of a physicist). Thus, all such statements are only hypotheses, 
called ontological. However, an ontological hypothesis may lead to contradictions 
with some observations; exactly that happens if one tries to claim objectivity of 
quantum observables. Hypotheses that do not lead to contradictions may be useful. 
For example, the objective existence of the chair nicely explains why we all agree on 
its properties. Similarly, the assumption that quantum systems can possess some 
objective properties can be useful for the quantum theory of classical properties 
or for a solution of the problem of quantum measurement. The fact that a realist 
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version of quantum mechanics appeals to our intuition is not essential, but it is a 
bonus. 
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Part I 

General notions and rules 
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As announced in the Introduction, we are going to divide quantum mechanics 
into two parts. The first one contains general notions and rules as well as the 
corresponding mathematical formalism and its physical interpretation, while the 
second is a menagerie of models. 
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Chapter 1 

States and observables 



This chapter describes the first part of the general language of quantum mechanics 
concerning the notions and most important properties of states, observables and 
their relation to space-time transformations. We shall use some of the nomenclature 
introduced in Refs. [HI [16] but no previous knowledge of these books is assumed. 
Ref. [IE] lists certain axioms first that must be satisfied by preparations and regis- 
trations and then derives quantum mechanics from these axioms. Our way will be 
the opposite one, similar to that of Ref. [16] : we assume the basic notions and rules 
of quantum mechanics first and derive general consequences for preparations and 
registrations later. 

1.1 States 

A preparation yields quantum object of certain type in a state while a registration 
results in a value of an observable. What is the type and what is the state is uniquely 
determined by the classical properties of the preparation and what is the observable 
is uniquely determined by the classical properties of the registration. 

In quantum mechanics, the states and observables are described by specific mathe- 
matical entities. If we say 'state' or 'observable', we always mean these entities. It is 
important to distinguish between the fact of being uniquely determined and the way 
from the empirical description to a particular mathematical state or observable. The 
calculation of a state from classical conditions defined by the preparation needs full 
quantum mechanics. Similarly, to calculate an observable from classical properties 
of the registration device, quantum mechanics must be used. At this stage of expo- 
sition, we just assume that the classical properties determine states and observables 
and consider the way of how they do it as belonging to the model part of the theory. 
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1.1.1 Mathematical preliminaries 

We assume that the reader is already more or less acquainted with most of the 
necessary mathematics and the purpose of the following text is mainly to settle the 
notation. 

Let H be a complex separable Hilbert space with inner product (-|-) satisfying 
(a(f>\bip) = a*b{<fi\vjj) : where V denotes complex conjugation. An element G H is a 
unit vector if its norm defined by 

U\\ = (010) 

equals one, 



= 1 , 

and the non-zero vectors 0, ip e H are orthogonal if 

(</#> = . 

A set {4>k\ C H is orthonormal if the vectors (pk are mutually orthogonal unit 
vectors. {<fi k } C H, k — 1, 2, • • • , is an orthomormal basis of H if any ip e H can be 
expressed as a series 



with 



k 

Separability means that there is at least one countable basis. 

Let H and H' be two separable Hilbert spaces, {4>k} and {<f)' k } two orthonormal 
bases, {4>k} C H and C H'. Let us define map U : {4>k} >->■ {<p' k } by 

U0 fc = <fr' k 

for each k. Then, U can be extended by linearity and continuity to the whole of H 
and it maps H onto H'. The map U is called unitary. Unitary maps preserve linear 
superposition, 

U(a^ + 60) = aU^ + 6U0 

and inner product, 

(u^iu^) = (m . (i.i) 

They can be defined by these properties for general Hilbert spaces and used as 
equivalence morphisms in the theory of Hilbert spaces. Each two separable Hilbert 
spaces are thus unitarily equivalent. 
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Any unit vector G H determines a one-dimensional (orthogonal) projection 
operator P[0] by the formula 

P[0]^ = (0|^>0 

for all ip G H. We also use the Dirac notation |0)(0| for this projector. If {<pk} is 
an orthonormal basis of H, then the projection operators P[0&] satisfy 

for all k 7^ /-we say they are mutually orthogonal-and 

k 

where 1 is the identity operator on H. 

A linear operator A : H i— )■ H is defined by the property 

A(a0 + bip) = aA0 + b/kip 

for all 0, i\) G H for which the right-hand side makes sense. A linear operator A is 
called bounded if its norm 

||A|| = sup ||A^|| (1.2) 

is finite. Let us denote the set of all bounded linear operators by L(H). Bounded 
linear operators can be multiplied, 

(AB)0 = A(B0) , 

and linearly combined, 

(aA + 6B)0 = a(A0) + 6(B0) 

for any a, b G C. L(H) is a Banach algebra with the norm (11.21) . 
The adjoint A^ of bounded operator A is defined by 

(AVl^> = (0|A^> 
for all 0, ip G H, and A is self-adjoint (s.a.) if 

A f = A . 

Linear combinations of bounded s.a. operators with real coefficients are again s.a., 
but their products are not, in general. Hence, the set of all bounded s.a. operators 
forms a real linear space. If completed with respect to the norm (jl.2p . it is a Banach 
space denoted by L r (H). 
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Unitary maps U : H i— > H are bounded operators and we obtain from Eq. (11.11) 

ut ■ u = U • U f = 1 . 

Let H and H' be two separable Hilbert spaces and U : H y H' be a unitary 
map. Then U defines a map of L(H) onto L(H') by A i— > UAU^. This map preserves 
operator action, 

(UAU f )(U0) = U(A0) , 



linear relation, 
operator product 
and norm, 



U(aA + 6B)U + = aUAU f + &UBU f 
(UAU t )(UBU t ) = U(AB)U f 



IUAII = IIAI 



on L(H). The Banach-space structure of L r (H) is also preserved by unitary maps. 
An operator A G L r (H) is positive, A > 0, where is the null operator, if 

(0|A0) > 

for all vectors G H. The relation A > B defined by 

A- B > 

is an ordering on this space. With this (partial) order relation, L r (H) is an ordered 
Banach space. The order relation is preserved by unitary maps, 

UAU f > UBU f if A > B . 

Let {4>k} be any orthonormal basis of H. For any A G L(H), we define the trace 

tr[A] = ^(0 fc |A0 fc ) . 

k 

Trace is independent of basis and invariant with respect to unitary maps, 

*r[UAU f ] = tr[A] . 



Proposition 1 Trace defines the norm ||A|| S on L r (H) by 

\\A\\ S = tr \y/M 

satisfying 

l|A|| s > ||A|| 

for all A G L r (H) 



;i.3) 
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For proof, see Ref. [IB], Appendix IV. 11. 



Definition 1 The norm U.ty) is called trace norm and all s.a. operators on H with 
finite trace norm are called trace-class. The set of all trace-class operators is denoted 
6yT(H). 

Trace norm is preserved by unitary maps. 

Proposition 2 T(H) completed with respect to the norm ( [Op is an ordered Ba- 
nach space. A trace-class operator is bounded, its trace is finite and its spectrum is 
discrete. 

For proof, see Ref. [IB], Appendix IV. 11. 

Let T(H)^ be the set of all positive self-adjoint trace-class operators on H with 
trace 1. As these operators are positive, their trace is equal to their trace norm and 
they lie on the unit sphere in T(H). T(H)|" is a convex set: let Ti,T2 G T(H)^, 
then 

wTx + (l-w)T 2 G T(H)+ 
for all < w < 1. It follows that any convex combination 

J = Y^w k J k (1.4) 

k 

of at most countable set of states G T(H)|~ with weights uik satisfying 

and the series converging in the trace- norm topology also lies in T(H)^. In general, a 
given state T(H)|~ can be written in (infinitely) many ways as a convex combination. 
All possible components of such convex combinations form a so-called 'face' in the 
convex set T(H T )+ (cf. [IB], p. 75). 

An element T is called extremal element of T(H)^ if it lies in a zero- dimensional 
face, i.e., if the condition 

T = wT x + (1 - w)T 2 

with Ti,T2 G T(H) ] f and < w < 1, implies that T = Ti = T2. Any extremal 
element has the form P [</>], where is a unit vector of H and, conversely, any 
projection onto a one-dimensional sub-space of H is extremal. The set of all extremal 
elements of T(H)] 1 " generates T(H)^" in the sense that any T G T(H)^ can be 
expressed as countable convex combination of some extremal elements, 

T = ^w fc P[0 fc ] . 

k 
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Such a decomposition can be obtained, in particular, from the spectral decomposi- 
tion 

T = ^ t k P k . 

k 

In that case T is decomposed into mutually orthogonal one dimensional projectors 
P[<f>i], with the eigenvalues representing the weights wi, each term appearing as many 
times as given by the dimension of the eigenspace. 

A unit vector of H defines a unique extremal element P[<j>] G T(H)+ but P[0] 
determines (j> on ly U P to a phase factor e ia . Due to the (complex) linear structure of 

H, there is an operation on vectors called linear superposition. Linear superposition 
ip = ^2 Ck<t>k of unit vectors with complex coefficients satisfying 

£ = 1 

k 

is another unit vector and the resulting projector P[ip], 

= *52 c * c k\<f>k)((f>i\ ^ M 2 |0fc)(0fcl • ( L6 ) 

kl k 

is different from the corresponding convex combination, 

^N 2 |0 fc >(0 fc | . (1.7) 
k 

Observe that P[ip] is not determined by the projectors P[<f>k] because it depends on 
the relative phases of vectors 4> k . 

I. 1.2 General rules 

With each quantum system S of type r, a complex separable Hilbert space H T is 
associated. H T carries a further structure dependent on r, such as a representation 
of Galilean group (see Sec. 1.3) and a tensor-product structure (see Sees. 2.1 and 
2.2). 

Rule 1 The states of a quantum system S of type r are elements o/T(H T )^. Every 
preparation creates a quantum object of a definite type and a definite state. The state 
is an objective property of the prepared system. In principle, each element o/T(H T )^ 
can be prepared as a state of a system of type r. 

Textbooks of quantum mechanics do not consider states as objective but this as- 
sumption is crucial for our interpretation. The objectivity of states however does 
not mean that each system of type r is in some state from T(H T )^ even if it does 




^2 c^k ) ( c ^ 
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not been so prepared (see Sec. 2.2.1). Rule 1 does not exclude that one and the same 
state can be obtained by different preparations. Methods of how states prepared by 
two different preparations can be compared will be described Sec. 1.2. 

Definition 2 Let T be a finite convex combination ( |i.^| ). In the case that the prepa- 
ration V[T] of J includes sub -preparations V\Tk] of each with relative weights Wk 
(independently of whether the mixing is done by humans or by nature; in practice, 
it is achieved by mixing beams), the decomposition fli.^p is selected by the prepara- 
tion ofT. Let us call the resulting state gemenge, the states components of the 
gemenge. 

A gemenge is also called direct mixture [H] or proper mixture [22] ; the word gemenge 
is taken from Ref . (16] . Gemenge concerns a physical property of preparation rather 
than any mathematical one of the right-hand side of Eq. ( II. 4p (such as being 
vector states or being mutually orthogonal, etc). From the mathematical point 
of view, many different convex decompositions of a general state T may exist. A 
preparation of T selects one of the mathematically possible convex decompositions 
of T. 

A random mixture of preparations is not uniquely determined by the preparation 
process. It can be coarsened or refined, i.e., some of "P[TY] can be combined into one 
preparation procedure or "P[Tfc] for some k can itself be a random mixture of other 
preparations P[T fc /]. 

Definition 3 The finest convex decomposition of state T defined by its preparation 
as gemenge is called gemenge structure of J. 

Thus, gemenge structure of T is uniquely determined by its preparation. For ex- 
tremal states, there is always only one gemenge structure, the trivial one, indepen- 
dently of how it was prepared. 

It may be advantageous to distinguish the mathematical convex combination of 
states from their gemenge by writing the sum in Eq. ( II .4p as follows 

t = r£j WkTk ( L8 ) 

in the case that the right-hand side is a gemenge structure of T. 

A gemenge contains more information about an object than its state operator 
does and it is, therefore, viewed as a refinement of the state concept. 

Proposition 3 The individual objects prepared in a state with gemenge structure 
M.8\) can be assumed always to objectively be in either of its component states T fc 
with respective probabilities Wk- 
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It also follows that a state itself even if it is prepared as a gemenge with non- 
trivial structure (11.81) is an objective property of any individual object prepared in 
this way. On the one hand, this is in agreement with our objectivity criterion, on the 
other it is completely analogous to classical properties which need not determine the 
state of a classical system uniquely, either. The states and their gemenge structures 
are objective properties (being determined by preparations) that we have called 
<i?/narmca0. We shall later encounter more examples of dynamical properties, but 
many will be derived from, or determined by, states. 

Set T(H r )^" of states can be viewed as analogous to phase space of Newtonian 
mechanics. Each point of T(H T ) ] f is analogous to a point in the phase space if it is 
prepared so that it has a trivial gemenge structure. The gemenges are apparently 
analogous to the probability distributions over the phase space. 

Let a classical system <S C be subject to physical conditions C that do not uniquely 
determine a point Z in its phase space but a whole probability distribution p[Z) 
on the phase space. Then, on the one hand, p(Z) is an objective property of each 
individual system S c subject to C and, on the other, each individual S c is usually 
assumed also to be objectively at a particular point Z (but cf. Chap. 5), which is, 
therefore, also its objective property. Thus, the two different states, p{Z) and Z, are 
simultaneously objective properties of the particular system S c . This is, of course, 
possible, because the two properties are logically compatible. 

The situation in quantum mechanics is completely analogous, but there is one 
difference. In Newtonian mechanics, point Z and non-trivial distribution p[Z) are 
mathematically different. In quantum mechanics, one and the same element T 6 
T(H r ) ] f can have both a non-trivial and a trivial gemenge structure. This depends 
on its preparation. Mathematical properties of T can restrict its possible gemenge 
structures, e.g. extremal states allow only the trivial one. 

Any unit vector e H T defines state P [<f>]. Such a state is called vector state 
(often, such states are called 'pure'). The states spoken about in Rule 1 are not 
necessarily vector states. However, vector states have a number of very interesting 
properties. In some special cases, vector states P[(f>k], k — 1, • • • ,N, can be prepared 
in such a way that their relative phases are also fixed. Such set of states is called 
coherent. Then, the linear combination of the vectors defines the state (11 .6p . P[ip], 
which is called linear superposition of the vector states P [</>&], k = 1, ■ ■ • , N. This 
leads to interference phenomena that are purely quantum and unknown in the New- 
tonian mechanics such as the electron interference in the Tonomura [67J experiment 
(see Sec. 1.2.2). 

The dynamical properties of quantum objects form a Boolean lattice similarly 
to the structural properties. This is to be understood in the following sense (cf. 

1 In Ref. [29], they were called 'conditional'. 
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Ref. [IS]). There is a set M of entities and a class N of possible properties of the 
entities. All entities having a given property form a subset that can be identified 
with the property. The family of all subsets form a Boolean lattice with the order 
relation, disjunction, conjunction and complement being the set-theoretical inclu- 
sion, intersection, union and complement in M. This is analogous to the properties 
of a system in Newtonian mechanics: All systems of a given type form M. Each 
element of M is objectively in one point of the phase space of the system. Then, 
properties can be identified with subsets of the phase space. 

The set of entities M that are considered in the present case consists of all quan- 
tum objects of the same type r. (Thus, they all have the same structural properties.) 
All states of any such object S that can be prepared form set N = T(H T )^. Any 
A C T(H r )^ represents all copies of S that are prepared in the states contained in 
A§ Another example of N is the set of all gemenge structures (II. 8ft of a given state. 

There is no finer or more complete description of a quantum system than by its 
gemenge structure. This implies that a prepared quantum system can be considered 
as a physical object in our theor}]^. 

Some of the above statements seem to contradict the well-known fact that the 
subset of states consisting of all possible projections (quantum logic) does not form 
a Boolean lattice. In Sec. 3.2.2, the lattice structure of the set of projections will be 
described. It is clear that there is no contradiction because the lattice operations 
that are used in quantum logic are different from those introduced above. The deep 
reason is that the properties of quantum logic are associated with registrations and 
are, therefore, not objective, while our properties are determined by preparations 
and can be considered as objective. 



1.2 Observables 

Once an object in a state has been prepared, one can carry out a registration on it. 
Mathematically, a registration is described by an observable. A better understanding 
of the notion of state can be achieved if results of all possible registrations applied 
to the state can be calculated. 

2 Observe that the Boolean lattice of subsets of N has nothing to do with the Boolean lattice of 
preparation procedures as defined in Ref. [48] : the lattice operations are defined in a different way. 

3 In Ref. [48] . p. 60, the problem is discussed whether a quantum system can be considered 
as a physical object. The criterion is that each state of the system is uniquely determined by its 
objective properties. Of course, this depends on what is a property. In Ref. [48], only values of 
observables are admitted as properties. Then, quantum systems are not physical objects. In our 
interpretation, the class of properties is much broader and certain quantum systems are, then, 
physical objects. 
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1.2.1 Mathematical preliminaries 

Definition 4 Let F be the Boolean lattice of all Borel subsets ofW 1 . A positive 
operator valued (POV) measure 

E : F ^ L r (H) 

is defined by the properties 

1. positivity: E(X) > for all X e F , 

2. a-additivity: if {X^.} is a countable collection of disjoint sets in F then 

E(U k X k ) = E(X fc ) , 

k 

where the series converges in weak operator topology, i.e., averages in any state 
converge to an average in the state. 

3. normalisation: 

E(IR n ) = 1 . 

The number n is called dimension of E. Let us denote the support of the measure E 
by f2. The set Q is called the value space of E. The operators E(X) for X e F are 
called effects. 

The support f2 of measure E is defined as follows 

n = {x G W n \E({y\\x -y\ < e}) ^ Ve > 0} . 

Let H and H' be two separable Hilbert spaces and U : H i— y H' a unitary map. 
Then UEU t : F ^ L(H') is a POV measure on H'. 
We denote by L r (H)< 1 the set of all effects. 

Proposition 4 L r (H)< 1 is the set of elements o/L r (H) satisfying the inequality 

< E(X) < 1 . (1.9) 

That effects must satisfy (\l.9\\ follows from the positivity and normalisation of a 
POV measure. On the other hand, each element of the set defined in Proposition 
4 is an effect. For example, if E is such an element, we can define E({1}) = E and 
E({— 1}) = 1 — E. The two operators satisfy Eq. ( 11. 91) and they sum to 1, so they 
determine a POV measure with the value set Q = { — 1, +1}. 

Proposition 4 implies that the spectrum of each effect is a subset of [0, 1]. An 
effect is a projection operator (E(X) 2 = E(X)) if an only if its spectrum is the 
two-point set {0, 1}. 



26 



Proposition 5 For any POV measure E : 
are equivalent 

E{Xf = 

for all X G F and 

E(X n Y) = 
for all X, Y e F . 



F i — y L(H) i/ie following two conditions 
E(X) 

E(X)E(V) 



Thus, a POV measure is a projection valued (PV) measure exactly when it is mul- 
tiplicative. In this case, all effects commute with each other. 

PV measures for n = 1 are equivalent to s.a. operators that need not be bounded. 
Sec. 1.1.1 introduced only bounded s.a. operators and we need a more general defi- 
nition. 

Let A be an operator on Hilbert space H that is not necessarily bounded. Then it 
is not defined on the whole of H but only on linear subspace Da that is dense in H 
and is called domain of A. The definition of adjoint has two steps: first, At is defined 
on Da and then extended. Similarly the definition of the self-adjoint operator, see 
e.g. Ref. [63] . 

To define a sum and product of two unbounded operators A and B, their domains 
are used as follows. If 

D A = D B 

then we say that A and B have a common domain. If, moreover, the common domain 
D is invariant with respect to both operators, i.e., 

AD C D , BD C D 

then the sum A + B and product AB are well defined on D and can be possibly 
extended. 

For s.a. operators, bounded or unbounded, the so-called spectral theorem holds, 
see Ref. [63]. This says that any s.a. operator A is equivalent to a PV measure, 
which is called, in this case, the spectral measure of A. Let E be a PV measure, then 
E determines a unique self-adjoint operator f R idE, where i denotes the identity 
function on R. Conversely, each s.a. operator A on H determines a unique PV 
measure E:F4 L(H) such that 

A = / idE . (1.10) 

If PV measure E is equivalent to a s.a. operator A, we shall denote it E A . Thus, 
POV measure is a generalisation of a self-adjoint operator. 

Another important special case is a discrete POV measure. A POV measure E is 
discrete if its value set Q is an at most countable subset of R. Let E be a discrete 
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POV measure and let f2 = {a^, k G N}, where N is the set of positive integers. Then 
E is a PV measure, E = E A , A = J^ fc a^k, a>k are eigenvalues of A and 

E fc = E({a fc }) 

are projections on the corresponding eigenspaces of the s.a. operator A that is defined 
in this way. 

For any POV measure E : F i— > L(H) and any T G T(H)^, the mapping 

Pt:F^[0,1] 

defined by 

p^(X) = tr[TE(X)] (1.11) 

for all X G F is a real a-additive probability measure with values in [0,1]. This 
follows from the defining properties of E and the continuity and linearity of the 
trace. The measure is preserved by unitary maps, 

tr[(UTU t )(UE(X)U t )] = tr[TE{X)] . 

We note that a convex combination of states induces a convex combination of 
measures, 

T = WkTk ^Pt = Y1 Wkp T* ■ 

k k 

More about mathematical properties of states and effects and the corresponding 
spaces T(H)+ and L^H)^ can be found in Ref. [48] . 

1.2.2 General rules 

In this subsection, we shall again work only with systems of definite type r. 

Rule 2 Any quantum mechanical observable for system S of type r is mathemati- 
cally described by some POV measure E : F 4 L r (H T ). Each outcome of an indi- 
vidual registration of the observable E performed on an object S yields an element 
o/F. Each registration apparatus determines a unique observable. 

Often, a stronger assumption is made, namely that each element of L r (H T ) is a 
sharp observable of system S of type r. Such systems are called proper quantum 
systems. In Sees. 2.2.2 and 6.2, we shall show that, in strict sense, no quantum 
system is proper. Usually, the construction of a model needs only few observables 
and, for a given model, our theory of quantum measurement will itself determine 
which POV measures cannot be observables. 

An important assumption of quantum mechanics is the following generalisation 
of Born's rule 
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Rule 3 The number Pj(X) defined by Eq. U.ll\) is the probability that a registration 
of the observable E performed on object S in the state T leads to a result in the set 
X. 

Using the same preparation N times gives always the same state T. Performing 
in each case the registration measuring E results in a value (or in an approximate 
value) y G Q. The relative frequency of finding y G X approaches Pj{X) if N — > oo. 
Hence, the observable is associated with an apparatus and the corresponding effects 
with subsets of values that can be obtained by registration processes. 

Proposition 6 If Ti and T 2 from T(H. T )f satisfy 

tr[J 1 E]=tr[J 2 E] 

for all E G L r (H T )< 1 then T x = T 2 ; if Ei and E 2 from L r (H T )< 1 satisfy 

fr-fTEx] =tr[TE 2 ] 
for all T G T(H r )+ then E x = E 2 . 

Thus, a state T can be determined uniquely, if it is prepared many times and a 
sufficient number of different registrations can be performed on it. As tr[ET] is 
independent of the gemenge structure of T, the question of what is its gemenge 
structure cannot be decided in this way with one exception: vector states. A vector 
state is extremal so that it cannot be a non-trivial gemenge. 

In Rule 3, the probability Pj{X) refers to measurements on an object and not to 
the object itself. In general, the property that value x G Q lies within subset X G F 
is not an objective property of objects in the sense that it could be attributed to an 
object itself and that the measurement would just reveal it. Such assumption would 
lead to contradictions, see Sec. 3.2.2. The property can be called a relational in 
the sense that it does not concern the quantum object alone but only in interaction 
with a definite registration devic^E It can also be called incidental, because it is not 
predictable, except in special cases, see Sec. 3.2.2. 

Thus, there is an asymmetry between states and observables: states are objective 
properties but observables are not; all elements of T(H r )^ can be prepared but not 
all POV measures are observables. 

Special cases in which the outcome of a registration is predictable are described 
as follows. Let E : F i-4 L r (H T ) and let T G T(H T ) 1 f be such that 

tr[TE(X)\ = 1 

4 There also are relational properties that are objective. Consider a point particle in Newtonian 
mechanics. Its angular momentum with respect to the origin of coordinates is an objective property 
of the particle if the origin is uniquely determined. It is relational, because it depends on the choice 
of origin, not only on the particle. 
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for some X G F. Then the probability that the registration of E on T will give a 
value in X is 1 and we can say that the prepared object in the state T possesses the 
property independently of any measurement. 

Proposition 7 The condition tr[TE(X)] — 1 is equivalent to 

E(X)T = T . 
In general, we call state T satisfying 

E(X)T = aJ 

with some real a eigenstate of E(X) to eigenvalue a. If T = P[ip], then ip is called 
eigenvector. Thus, T in the Proposition 7 is an eigenstate of E(X) to eigenvalue 1. 

If a PV measure is an observable, we call the observable sharp. Let us consider 
a discrete observable A (which is always sharp) with spectrum {a k } that is non- 
degenerate, Pk = P^fe] f° r ai l k. ipk are eigenvectors of A. If we prepare the state 
P[ipk] and then register A, the result must be a k with probability 1. Next suppose 
that we prepare a linear superposition P[^f], 

oo 

1 

with ^2 \ck\ 2 = 1. The probability p k that the registration of A will give the result 
a k is 

p k = tr[P[^ k }P[^}} = \c k \ 2 . (1.12) 

Eq. (11.121) gives the physical meaning to the coefficients in the linear superposition 
and is called Born rule. Observe that the probability is only meaningful if it concerns 
really existing (but may-be unknown) outcomes, see Sec. 3.1. Thus, it can be only 
associated with the registration, not with the object in state P[^]. One can say that, 
before the registration, the object in this state possesses all values a k for which c k ^ 
simultaneously. 

This can be seen experimentally by means of the interference phenomena: the 
average in state P[^] of observable B that does not commute with A does depend 
on matrix elements of B between different states ip k : 

tr[BP[*]] = ^4Q(^|B|^> . 

kl 

Rule 3 implies useful formulae for averages and higher moments of E that gener- 
alise the well-known formulae for sharp observables. For the average (E) of observ- 
able E with n = 1 in the state T, we have 

(E) = / Ldfi . 
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Using Eq. (II. lip , we obtain 

(E) = f Ld{tr[TE)) =tr T / tdE 
Jr L Jm. 

For a sharp observable E A , Eq. (11.101) yields the usual relation 

(A) = tr[TA] . 

In the case of vector state, T = P [</>], we obtain (A) = (0|A0). For higher mo- 
ments, we have just to substitute suitable power of i in the integral and the proof 
is analogous. 

The values of an observable are not objective properties of an object but its states 
are. As averages and moments of an observable are uniquely determined by states, 
we have 

Proposition 8 The average (E) of any observable E in state T of object S is an 
objective property of S that has been prepared in state T. 

One may wonder how an average can be objective if the values from which the 
average is calculated are not. However, the value (E) of the average is uniquely 
determined by the preparation and (E) is fixed before any registration, whereas the 
outcomes of individual registrations are not. In fact, the values that are obtained 
by registrations must satisfy the condition that they sum to a given average. 



1.3 Galilean group 

In quantum mechanics, Galilean relativity principle holds in the following form: 

Rule 4 The same experiments performed in two different Newtonian inertial frames 
have the same results, i.e. give the same probability distributions. 

'The same experiment' means that all conditions of the first experiment have the 
same description with respect to the first inertial frame as those of the second 
experiment have with respect to the second frame. In the present section, we restrict 
ourselves to the proper Galilean group and work out some consequences of the 
principle. For example, the most important PV measures of quantum mechanics 
will be defined and the time evolution equation formulated. We just list the relevant 
facts; for motivation and technical detail, see, e.g., Ref. [56J. 

The group of transformations that leave the geometric structure of Newtonian 
spacetime invariant^ (see, e.g., Ref. [50], p. 296) is called Galilean group G. It 

5 We understand symmetry as transformation that leaves some well-defined structure invariant. 
For example, Galilean group contains all transformations that leave Newtonian spacetime geometry 
invariant and a symmetry of a quantum system leaves its Hamiltonian invariant. 
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is also the group of transformations between inertial frames. Let (xi, x 2 , X3, t) be 
a Newtonian inertial frame so that Xi,x 2 ,x 3 are Cartesian coordinates. A general 
element g(X) of the component of unity of G (called proper Galilean group, G + ) 
can be written in the form 

x T ^ Of T + a T + v T t (1.13) 

and 

t^t + \ 10 , (1.14) 

where is a proper orthogonal matrix determined by three parameters Ai, A2, A3, a 
is a vector of space shift with components A4, A5, Aq, v is a relative velocity of the 
frames with components A7, Ag, A9 and A10 is a time shift. The relations are written 
in matrix notation so that, e.g., x T is a column matrix with components of vector 

— * 

x. 

The group product g(\ 3 ) = (?(A 2 )^(A 1 ) is defined as the composition of the trans- 
formations g(\ 2 ) and ^(A 1 ) so that ^(A 1 ) is performed first and g(\ 2 ) second. Then, 

3 = 2 0! , 

aj = a J + 2 aj + vj\ l 



and 



10 ) 

vj = vj + 2 vj , 
X 3 — X 2 _l X 1 



HO — x uo ~ X U0 

Given a quantum system S of type r, classical apparatuses that are supposed 
to prepare and register S have well-defined Galilean transformations. There are 
then some conventions on the corresponding action on states and effects. Often, S 
is subject to influences external to S such as classical external fields. These also 
possess non-trivial transformation laws with respect to Galilean transformations. 
We call system S isolated, if the approximation that it is alone in the universe is a 
good one. Thus, there are no external influences. 

Let us consider a measurement that consists of a preparation and a registration 
of system S of type r by apparatuses A p and A r , respectively, and let there be some 
external fields /. Let A p prepare object S in state T and let A r register effect E(X). 
Next, we can transport both A p and A r by g G G to g(A p ) and g(A r )- Let T g j 
be the state prepared by g(A p ) and E g j(X) the effect registered by g(A r ). If the 
external influences are also transformed by g : / 1— >■ <?(/), we obtain state ~^ g , g (f) and 
effect Eg t9 ff)(X). The experiment has then been completely transferred by g and, 
therefore, Galilean relativity principle implies 
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Proposition 9 State T g , g {f) and effect E S)9 (/)(X) satisfy 

tr[JE(X)] = tr[T gMf) E 9Mf) (X)] (1.15) 
for allgeG,J e T(H r ) + and E{X) G L r (H r )^. 

Let us compare the two effects E(X) and E g ^^{X). In the case that Borel sets 
X have a well-defined transformation with respect to g, 7(g) : F n- F and that the 
two effects belong to the same observable, we assume that 

E 9 , g( /)P0 = E( 7 foP0 • 

Let us next compare measurable properties of the two states T and ~Tg, g (f)- Propo- 
sition 9 implies: 

tr[T gMf) E(X)]=tr[TE g - ljg - Hf) (X)} . 

Hence, values that can be registered on the state T 9 9 (j) are those on T transformed 
by g~ x . 

At this stage, one can formulate assumptions about the dependence of T g j and 
E g j(X) on / and about the way G acts on T(H T )^ and L r (H T )< 1 (see, e.g. Ref. 
[48J). Then, important theorems about action of G on H r can be proved. However, 
it is simpler and for our purposes sufficient to assume directly: 

Rule 5 Let S be an isolated system (f = 0) of type r within which no separation 
status changes happen (see Sec. 2.2.4) an d let G + denote the universal covering 
of proper Galilean group. Then, for each element g G G + there is a unique map 
\J(g) : H T 1 — y H T so that 

1. P[U(#] = P[4-i /ora^GH T) 

2. g 1 — y U(g) is a unitary ray representation of G + depending on t, 

3. for all T G T(H T )^ ; E(X) G L r (H T )^ and g G G + ; we have 

T g -i,o = U((/)TU((y)t , (1.16) 
E g - lfi (X) = U(g)E(X)U( g y . (1.17) 

The 'ray' representation means that U ((72(71) = Gxp[iu(gi, (?2)]U((72)U((?i). The repre- 
sentation depends of r. For example, the representation is irreducible for particles. 
Irreducible representations are classified by three numbers, fi, s and V, with the 
meaning of mass, spin and constant potential, respectively (see Ref. [16], p. 221). 
Clearly, the probability is preserved, 

tr[(\J(g)T\J( g y)(\J(g)E(X)U(gn = tr[JE(X)} 
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and we also have 

U(g)E(X)U( g y) = E(7(y)- 1 X) (1.18) 

(see Ref. [20J for generalisation of this relation). 

Now, the question arises: what is the form of spacetime transformations for 
systems that are not isolated? There are two different answers, one for Euclidean 
and one for time-translation subgroup of G. This is a part of the asymmetry between 
time and space in quantum mechanics. 



1.3.1 Euclidean group 

Let be the Euclidean subgroup of G. We have: 

Rule 6 Let S be a system of type r not necessarily isolated. Then the conclusion 
of Rule 5 is valid only for the universal covering of proper Euclidean group, G%, 
if f — is replaced by arbitrary f and 'unitary ray representation' is replaced by 
'unitary representation'. 

The specific form of the representation U(g) depends on the type of S and can 
be described in all detail only after model assumptions on S are made (see Chap. 
4). However, some general relations can be written already now. 

If we restrict ourselves to the subgroup of space translations then the correspond- 
ing element of Ge can be denoted by g{a) and its representative on H T by U(a). 
Similarly, each rotation can be described by unit vector n along its axis and angle 6 
of rotation in the counter-clockwise direction around the axis. Let the corresponding 
elements of the rotation subgroup be 0(n, 9) with representative U(n, 6). 

Given a one-parameter group of unitary operators U(A), then according to Stone's 
theorem, there exists operator G satisfying 

U(A) = exp(iGA) . 

It called of generator of group U(A) and can be calculated with the help of the 
formula 

If the parameter is not specified by any convention, it is defined only up to a real 
multiplier and so is the generator. 

Let us define a one-parameter subgroup by U(Aa). Its generator can be written 
in the form (l/i/?,)(aiPi + a 2 P 2 + a 3 P 3 ), where Pfc are three s.a. operators. Similarly, 
the generator of the subgroup U(n, A) has the form {l/iK)(n\S\ + ri2^2 + W3J3), where 
Jfc are s.a. operators. 
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Definition 5 The three s.a. operators are three components, with respect of the 
coordinates X\,X2,x%, of the total momentum and similarly J*, are components of 
total angular momentum of S. 

The commutators of and are determined by the fact that they are generators 
of Ge and that g H- U(g) is a group representation: 

[P fc , Pi] = , [J fc , Jj] = ihe U jSj , [J fc , Pj] = ihe k ijPj . (1.19) 

Most quantum systems admit position as one of their observables. All massive 
particles and all systems composite of a fixed number of massive particles, their 
position being their centre of mass, do. Given an inertial frame (x,t), one can 
measure the corresponding coordinates of the quantum system. We assume: 

Rule 7 There are three s.a. operators Qk such that the average position of object S 
in state T is tr[TQk] and 

U(3)Q fc U(3)t = Q k -a k l 

and 

U(n, 6)Q k V(n, 0)t = ^ 0(fi, -0) W Q, . 

i 

Hence, the position operators are transformed by Euclidean group in agreement 
with Eq. (11.131) and the corresponding operators U(g) transform according to Eq. 
(|1.18p . From these transformation properties, the commutation relations of position 
Qk with total momentum and total angular momentum J k can be derived, 

[Q k , Pi] = ih8 kl l , [J fc , Qt] = ihtkijQj . (1.20) 

Commutation rules (11.191) and ( 11.201) determine the action of operators and 
Pfc on H T uniquely. Consider the set D of rapidly decreasing C°° functions £), 
where £ represents a set of parameters dependent on the system type r. For example, 
for a composite system, it can represent the relative coordinates of the constituents 
with respect to the centre of mass x and for a particle with spin, it can represent 
the components of the spin in some direction, etc. The rapid decrease is required 
with respect to x and those parameters £ that have unbounded ranges of values. In 
fact, D or its generalisation to functions of more than one argument is a common 
invariant domains of all s.a. operators in quantum mechanics and the products of 
these operators can be defined with the help of D. 

Let H r be the completion of D with respect to the inner product 
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where the sum over £ represents sums over all discrete parameters and integrals over 
all continuous ones. The elements of H r are called wave functions. To describe the 
operators, it is sufficient to define the action of their components on C°° functions 
<f>(x, £) because there is always only one s.a. extension. The result is 

Q fe 0(f,O =a*0(f,O (1.21) 

and 

P fc 0(f,O = -^^0(x,O • (1-22) 

For the total angular momentum, the result is more complicated and r-dependent. 
We postpone its description to Chap. 4. 

The form of Hilbert space and the operator described above is called (^-representa- 
tion. All operators on H T in Q-representation can be written in the form of integral 
operators with kernels 

A = A(x, £;£',£') , 

the kernel A(x, £; x', £') being a generalised function of its arguments acting on func- 
tions (/>(x,£) as follows 

(A0)(f, o = J2 1 d ' x ' A ^ ' & f/ ' o . 

where the right-hand side represents the action of the generalised function on a test 
function. 

1.3.2 Time translations 

The next set of transformations are time translations. In quantum mechanics, time 
is, unlike position, just a parameter. This is another part of the asymmetry between 
time and space. For general systems (not necessarily closed), we assume 

Rule 8 Let S be a system of type r that does not take part in any separation status 
change (see Sec. 2.2.4) an d ^ the external fields f be given. Then, time translation 
from t\ to t 2 is represented by unitary operator U(f,t2,ti) on H T satisfying 

U(/,t 3 ,ti) = U(/,t 3 ,t 2 )U(/,t 2 ,t 1 ) 

and we have 

PM,(fa-*o-v = mt2,h)P[^Mf,t 2 ,t^ , 

where g{t 2 — ti) is the group element for Ai = • • • = Ag = and Aio = t 2 — t\. 
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A time translation does not define a unique map on T(H r ) ] f in this case. U(/, t 2 , ^i) 
depends not only on t 2 — ti but also on the position of the system with respect to 
the external fields (even if the fields are stationary). We shall let out the argument 
/ in U(f,t2,ti) in agreement with the current practice. 

Definition 6 The operator H(t) defined by 



is the Hamiltonian of S. 

Operators H(i) for different t do not commute in general. 

The most important observable in non-relativistic quantum mechanics is energy. 
The corresponding operator for system S is its Hamiltonian. Any individually pre- 
pared quantum systems has a definite Hamiltonian with the form 



where function H symbolises the construction of the operator from the other s.a. 
operators of S. The Hamiltonian of system S is an objective property of S. Also, 
the form of Hamiltonian is a model assumption of quantum mechanics. The choice 
of a Hamiltonian is usually the most important step in model construction. 

Hamiltonian H of quantum system S determines the dynamics of S. Dynamics 
has to do with the time aspect of preparation and registration. Any preparation 
procedure finishes at some time instant t p and any registration procedure starts at 
some time instant t r . The times t r and t p can serve as defining the time aspects 
because the whole preparation or registration processes can themselves take some 
time. The dynamics enables to calculate how the probabilities depend on the times 
t r and t p . The dependence can be obtained in two ways. We can make either state 
T(t) to a function of t by shifting T(0) forwards or observable E(£) by shifting it 
backwards by U(i, 0). The probability pj(X,t) corresponding to the time t is then 



The first method is called Schrodinger picture, the second Heisenberg picture. 
Rule 8 implies: 

Proposition 10 Let quantum system S have a Hamiltonian H(i). Then, the dy- 
namical evolution ofS in Schrodinger picture obeys von Neumann- Liouville equation 
of motion 



Hit) =ihU(t,t ) 



rfU(Mo) 
dt 



H = H (operators, external fields) , 



p^X,t)=tr[J(t)E]=tr[JE(t)} . 






(1.24) 
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Proposition 11 Let T be a vector state \<f>)(<f>\. Then 



T(t) = W))W)\ - 



where <f>(t) obeys Schrodinger equation 



ih 



d(f>(t) 
dt 



H(t)cf>(t) 



(1.25) 



and 



0(t) = U(t, 0)0(0) . 



It follows that Liouville-von-Neumann equation for states also determines an evo- 
lution equation for vectors. In particular, vector states remain vector states in 
evolution. More generally, we assume 

Rule 9 Let T be a gemenge $1.8\) with components Then, its time evolution is 
again a gemenge with components Tk(t) 



where T^(t) is given by $1.23\) and Wk are time independent, for each k. 

Of course, Eq. f 1 1 . 2 6 [) is easily obtained from Eq. (jl.8p by multiplying both sides 
by U(t, 0) from the left and by U(t, Oy from the right. The non-trivial assumption 
is that the state remains a gemenge of the described kind. Hence, if an object 
S is prepared in the gemenge state T described by Eq. (11.81) at time and then 
evolved, then the state of each individual object S is objectively always one of the 
components T^it) at time t. 

Let us now restrict ourselves to isolated systems. In this case, the Hamiltonian 
H(i) = H is time independent s.a. operator, the unitary time translation is 



where U(Aio) is the representative of the time translation in the unitary representa- 
tion of the Galilean group on H T and Aio = £2 — ti- Clearly, the Hamiltonian must 
then commute with the total momentum and angular momentum, 




(1.26) 



U(t 2 ,*i) = exp[ 



{i/h)H{t 2 -h)} 



and 



U(t 2 ,ti) 



U(Aio) , 



[H,P fc ] = 0, [H,J fc ] = 0. 



38 



Definition 7 Each unitary transformation U : H r i— > H T that leaves the Hamilto- 
nian H of S invariant, 

UHU f = H 

is called a symmetry of system S. 

All symmetries of a system S form a unitary group which is an objective property of 
S. The generators of its one-parameter subgroups are s.a. operators that commute 
with the Hamiltonian and, as sharp observables, yield probability distributions that 
are independent of time. 

We have ignored the improper transformations such as space inversions and time 
reversal. More about them can be found in Ref. [73J. 
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Chapter 2 

Composition of quantum systems 



This is a key chapter of this Survey. It emphasises the difference between composi- 
tion of systems of the same and of different types. It lists the problems to which the 
standard rules about composition of identical systems lead and it shows a rigorous 
way of removing these difficulties. 

2.1 Composition of systems of different types 

Suppose that two objects of different types are prepared. The objects can be par- 
ticles or composite systems. Then, one can consider this pair of objects as one 
quantum object. This is called composition of objects. First, we describe the math- 
ematical apparatus. 

2.1.1 Tensor product of Hilbert spaces 

The tensor product H = Hi tg> H 2 of H x and H 2 is the Cauchy completion of the 
linear span of the set of products 

(g) 4> : e Hi, ^ e H 2 

with respect to the inner product of H, which is determined by 

(0®^|0'®^> = (0|0')(V#'> • 

The tensor product <fi (g> ip is linear in both arguments. Moreover, if {<f>k} and {ipk} 
are bases of Hi and H 2 , then {4> k <g> -0/} is a basis of H. If the bases are orthonormal 
then any IgH can then be expressed as 

* = ^{<Pk®^)<Pk®^i ■ 
ki 
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If A G L(Hi) and B G L(H 2 ), then their tensor product A <g> B on Hi <g> H 2 is 
determined via the relation 

(A <g> B)(0 <g> ip) = A0 <g> Bi/> 

for all G H x and ip G H 2 . It follows that 

tr[A ® B] = tr[A]tr[B] . 

The tensor product T x (g) T 2 of T 1 G T(H 1 )|" and T 2 G T(H 2 ) ] t ' is determined in 
the same way and it is a trace-class operator with trace 1. However, T(Hi <g> H 2 )+ 
contains more than just tensor products of elements from T(Hi)|" and T(H 2 )+. 
The partial trace over the Hilbert space H 2 , say, is the positive linear mapping 

n 2 :T(H 1 ®H 2 )+^T(H 1 ) + 

defined via the relation 

tr[n 2 (W)A] = tr[W(A <g> 1 2 )] 

for all A G L(Hi), W G T(Hi (g) H 2 ) and 1 2 is the identity operator on H 2 . If 
{4>k} C Hx and {tp k } C H 2 are orthonormal bases, then II 2 (W) can be written as 

n 2 ( w) = (h ®^*|w(^-<g>^ fc ))|&) (fa I . 

ijk 

Here \(f>i){<f)j\ is the bounded linear operator on Hi given by 

for all G Hi. The partial trace over Hi is defined similarly. 

If W = Ti <g> T 2 , then T x = n 2 (W) and T 2 = n^W) but, in general, 

n 2 (W) <g>iii(w) . (2.i) 

In particular, if W = P[\l/], then 

p[*] = n 2 (P[*]) (8> ni(P[*]) 

if and only if 

\J/ = <g) t]) 

for some G Hi and -0 G H 2 . In that case also 

n 2 (P[*]) = P[0] , ni(P[*]) = p^] . 
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Thus, tensor products of extremal elements of the sets T(Hi) ] f and T(H 2 )^ do not 
exhaust the set of extremal elements of T(Hi £§> Hz)t- 

Tensor products of more Hilbert spaces and the corresponding notions and rela- 
tions can be obtained using the above rules. 

Tensor product is also an operation for POV measures. Let Ei : Fi (->■ L r (Hi) 
with dimension n\ and E 2 : F 2 i— > L r (H 2 ) with dimension n 2 be two POV measures 
on Hilbert spaces Hi and H 2 with value sets Oi and f2 2 . Then POV measures 
(Ei ® E 2 ) : (Fi x F 2 ) \-t L r (Hi <8> H 2 ) on the tensor product Hi Cg> H 2 has dimension 
ni + n 2 and values set 17 1 x f2 2 and is defined by 



for all X\ C W 11 and X 2 C W 12 . Tensor product of POV measures is associative but 
not commutative. 

Let us now turn to physical interpretation. 

Rule 10 Let S\ and <S 2 be two quantum objects of different types and their Hilbert 
spaces be Hi and H 2; respectively. Then, the object S composed of S\ and 5 2 has 
the Hilbert space H = Hi g) H 2 , its states are elements of T(Hi ® H 2 )^ and its 
effects are elements o/L r (Hi <g> H 2 )< 1 

Rule 11 Let T be a state of a composite object S + S' . The necessary and sufficient 
condition for the gemenge structure of the partial trace over S' to be 



where T' k are some states of S' . 

One can say that gemenge structures are invariant with respect to compositions. 
By registration of observables of S alone, only the state operator T defined by Eq. 
(11. 8p can be determined. However, by registration of observables pertaining to a 
composite object S + S', some information about the gemenge structure of T can 
be obtained from Rule 11. Suppose, e.g., that S + S' is in a vector state P[^]. 
Then partial trace n'(P[\l/]) cannot be a gemenge. This fact is at the root of the 
objectification problem in quantum theory of measurement (cf. [16], our Sees. 6.1 
and 6.3). 



(Ei ® E 2 )(Vi x X 2 ) = Ei(Xi) <g> E 2 (V 2 ) 




is that T itself has the gemenge structure 




k > 
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If the state of the composite object 5i + 5 2 is W 6 T(Hi <8> H 2 )^, then the 
two partial trace operations determine unique states, n 2 (W) of Si and II^W) of 
<S 2 . These states are called reduced states. The fact that the reduced state n 2 (W) 
is uniquely determined by W means, in particular, that objects Si and <S 2 can be 
identified as subsystems of Si + S 2 . 

Based on the mathematical properties of the tensor products, the description of 
objects composed of arbitrary number of sub-systems can be obtained by extension 
of the above methods. The only condition is that no two subsystems belong to the 
same type. 

2.1.2 Entanglement 

Definition 8 If object S composite of two quantum objects Si and 5 2 is in a state 
W that satisfies condition 112. 1\) one says that Si and S 2 are entangled or that state 
W is entangled. 

Entanglement is a physical phenomenon that has measurable consequences. One 
of these consequences are correlations between outcomes of registrations that are 
performed jointly on the two (or more) entangled systems. 

As an example, consider two objects Si and <S> 2 and two sharp observables Aj : 
Hj h> Hj, % — 1, 2. Let \di) G Hj and \bi) G Hj be four eigenstates, 

ki\a,i) = ai\ai) , ki\bi) = bi\bi) , 

and bi > Oj, i — 1, 2. The state P[\l/] of the composite object Si + 5 2 , where 

= -/=(h> ® Ik) + \h) ® |o 2 )) 

is entangled. Indeed, 



and P[^] is not a tensor product of these two states. 

Let us calculate the correlations of two sharp observables, Ai (g> 1 and 1 <8> A 2 on 
Hi <S> H 2 . The observables commute hence we can measure them jointly (see Sec. 
3.2). First, we need: 

Definition 9 Let IK be a s.a. operator with a dense invariant domain and T be an 
arbitrary state. Then 





(2-3) 



is called variance of A in T. 
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Second, 

Definition 10 Normalised correlation of any two observables A and B in a state T 
is defined by 

c(A , B , T)= MTABl- tr[ TA]MTBl 
v ; AAAB v y 

The normalised correlation satisfies 

-1 < C(A, B, T) < 1 

for all A, B and T because of Schwarz' inequality. The proof is based on the facts 
that Retr [TAB] is a positive symmetric bilinear form on real linear space L r (Hi <8> 
H 2 ), Retr [TAB] = tr [TAB] if A and B commute and their correlation can only be 
measured if they commute. If C(A, B, T) = 1, the operators are strongly correlated, 
if C(A, B,T) = 0, they are uncorrelated and if C(A, B, T) = —1, they are strongly 
anticorrelated (see [16], p. 50). 

First, we calculate their first two moments of observables Ai ® 1 and 1 <S> A 2 in 
state P[tf]: 

(tf |Ai ® = i(ai + 6i) , ® A 2 |*) = i(a 2 + b 2 ) 

and 

(*|(A X ® l) 2 |v&) = -(a? + 6?) , <*|(1 ® A 2 ) 2 |v&> = -(a* + 6g) . 
The variances are 

A(Ai <g> 1) = (^-^ J > A(1®A^ 2 
The average of the product (Aj <g) 1)(1 <g> A 2 ) = (Ai g) A 2 ) is 



\ 2 / \ 2 

&i — ai \ a / -t a \ (b 2 -a 2 



($|Ai ® A 2 $) 



ai& 2 + 6ia 2 



2 

so that, finally 

C(A! ® 1, 1 ® A 2 , P[*]) = -1 . 

The result is that the observables are strongly anticorrelated in state P[^]. What this 
means, can be seen from the probability distributions for different possible outcomes 
by measuring the two observables. The corresponding PV measures define the four 
projections 

P aa = Ei({ ai }) ® E 2 ({a 2 }) , P ab = Ei({ ai }) ® E 2 ({6 2 }) , 
P 6a = Ei({6i}) ® E 2 ({a 2 }) , P bb = Exdh}) ® E 2 ({6 2 }) , 
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and we obtain 

(*|P aa |*) = 0, (*|P a6 |*) = -, 

(*|P 6a |*) = i , (*|P 66 |#) = o. 

It follows: if the registration of Ai <8> 1 gives a± (bi) then the registration of 1 ® Ai 
gives 62 (ct2) w ith certainty, and vice versa, the correlation being symmetric with 
respect to Si and S 2 . 

Suppose now that Si and S 2 are far from each other, Si near point X\ and S 2 near 
point x 2 . Could one use the strong anticorrelation to send signals from x\ to x 2 , 
say? There are two reasons why one cannot. First, one has no choice of what value, 
ai or &i one obtains at x\. As is easily seen from the state n 2 (P[^ r ]) of «Si, Eq. ( 12. 2p . 
the probability of each outcome is 1/2. One has, therefore, no control about what 
signal will be sent. Second, suppose that the state P[\l/] is prepared many times and 
let, in the first chain of experiments, the observer at x\ registers Ai every time, in 
the second chain, he does nothing. Is there any difference between the two cases 
that could be recognized at x 2 l In the first case, the state of S 2 is given by Eq. (12. 2p 
including its gemenge structure, whereas, in the second case, the state operator is 
the same but it is not a gemenge because the composite system is in a vector state. 
However, by registration of any observable pertinent to S 2 , the observer at x 2 cannot 
distinguish different gemenge structures of the same state operator from each other. 

The next interesting question is, how the influence of a registered value at x\ on 
that at x 2 is to be understood. Even in classical mechanics, one can arrange strong 
correlations. For example, if a body with zero angular momentum with respect to 
its centre of mass decays into two bodies flying away from each other, the angular 
momentum of the first is exactly the opposite to that of the second. This strong 
anticorrelation cannot be used to send signals, either. It is moreover clear, that a 
measurement of the first angular momentum giving the value Li is not a cause of the 
second angular momentum having the value —L\. Rather, the decay is the common 
cause of the two values being opposite. The process of creating the two opposite 
values at distant points is also completely local: the decay is a local phenomenon and 
the movement of each of the debris is governed by a local equation of motion. And, 
the values Li and —L\ are objective, that is, they exist on the debris independently 
of any measurement and can be, in this way, transported from the decay point to 
the measurements in a local way. 

In quantum mechanics, such an explanation of the correlations is not possible. 
A value of an observable is created only during its registration. It does not exist 
in any form before the registration, except in the special case of state which is an 
eigenstate. However, in our example, ^ is not an eigenstate either of Ai ® 1 or of 
1 <g> A 2 . The registrations performed simultaneously at x\ and at x 2 , which can be 
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very far from each other, are connected by a relation that is utterly non-local. How 
can the apparatuses together with parts of the quantum system at two distant points 
x\ and at 'know' what values they are to create in order that the correlations 
result? Note that the rejection of objectivity of observables leads more directly to 
non-locality than assumption of some sort of realism. 

The nature of simultaneity with which the correlations take place can be described 
in more details, see Ref. [37] . 




The non-local correlation between the registration outcomes can be ascertained 
only if both values are known and can be compared. It may therefore be more precise 
if we say that the non-local correlations can only be seen if the non-local observable 
Ai <g> A 2 is registered. This non-locality of quantum correlations in entangled states 
does not lead to any internal contradictions in quantum mechanics and is compatible 
with other successful theories (such as special relativity) as well as with existing 
experimental data. Nonetheless it is very surprising and it has been very thoroughly 
studied. In this way, various conditions (e.g., Bell inequality) have been found that 
had to be satisfied by values of observables if they were objective and locality were 
satisfied. Experiments show that such conditions are violated, and, moreover, that 
their violation can even be exploited in quantum communication techniques |14j . 

Finally, let us repeat here that non-objectivity of observables is accepted by our 
interpretation of quantum mechanics in the full extent and that this does not lead 
to any contradiction with Basic Ontological Hypothesis of Quantum Mechanics and 
with our realist interpretation. 

2.2 Composition of identical systems 

If a prepared object has more than one subsystem of the same type (identical subsys- 
tems), then these subsystems are indistinguishable. This idea can be mathematically 
expressed as invariance with respect to permutations. 

2.2.1 Identical subsystems 

Let Sat be the permutation group of iV objects, that is, each element g of is a 
bijective map g : {1, • • • , N} i— > {1, • • • , N}, the inverse element to g is the inverse 
map g~ x and the group product of g\ and #2 is defined by {g\g2){k) = gi(g2(k)), 



Given a Hilbert space H, let us denote by H the tensor product of iV copies of 




k G {!,■■■, 



N}. 



H, 



H 



■iV 



H <g> H <g> • • • <g> H . 



46 



On H N , the permutation group Sn acts as follows. Let if) k ell, k = I, - • ,N, then 
and 

gOl g> • • • ® ^v) = V's(l) ® • • • ® ^s(JV) • (2-5) 

g preserves the inner product of and is, therefore, bounded and continuous. 
Hence, it can be extended by linearity and continuity to the whole of H N . The 
resulting operator on is denoted by the same symbol g and is a unitary operator 
by construction. The action (12.51) thus defines a unitary representation of the group 
S N on H*. 

All vectors of that transform according to a fixed unitary representation 
7Z of Sat form a closed linear subspace of H w that will be denoted by H^. The 
representations being unitary, the subspaces are orthogonal to each other. Let 
us denote by the orthogonal projection operator, 

pg^H^Hg. 

The location order of a given state in a tensor product can be considered as an 
information about the identity of the corresponding system. Such information has 
no physical meaning and a change of the ordering is just a kind of gauge transforma- 
tiorfl. Motivated by this idea, we look for one-dimensional unitary representations 
of because only these transform vectors by a phase factor multiplication. Sat has 
exactly two one-dimensional unitary representations: the symmetric (trivial) one, 
g i — y 1 , and the alternating one g !->■ r](g)l for each g £ Sjv, where r)(g) = 1 for 
even permutations g and rj(g) = —1 for odd permutations g. If 1Z is the symmetric 
(alternating) representation we use symbol (H^) for H^. Let P{ N ^ (P£ ) be 
the orthogonal projection on (H^). Note that the usual operation of symmetri- 
sation or antisymmetrisation on a vector \1/ £ H N , such as 

i) <g> ^ (i/2)(V> ® ± ® VO 

in H 2 , is nothing but Ps \& or P a ty, respectively. 

Now, we are ready to formulate the basic assumption concerning identical subsys- 
tems. From relativistic quantum field theory [72], we take over the following result. 



Rule 12 Let S be a composite quantum object consisting of N subsystems S, each 
of type t with Hilbert space H r . Then, with important exceptions to be specified 

1 A different and independent part (ignored here) of the theory of identical particles is that 
states of two identical systems can also be swapped in a physical process of continuous evolution, 
and can so entail a non-trivial phase factor at the total state (anyons, see, e.g. Ref. [71]). 
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later, the Hilbert space of S N is H^, for subsystems with integer spin and for 
those with half-integer spin. 

Definition 11 Systems with integer spin are called bosons and those with half- 
integer spin are called fermions. The symmetry properties of states leads to Bose- 
Einstein statistics for bosons and Fermi-Dirac one for fermions. 

We can, therefore, introduce a useful notation, a common symbol for the 

subspaces of the symmetric and anti-symmetric representations and for the 
corresponding projections because the representation is determined by the system 
type r. 

Proposition 12 The states of any system S N consisting of subsystems of type r for 
which Rule 12 holds are elements o/T(H^/n)+ and the effects of S N are elements 



Proposition 12 follows directly from the definition of a system with a given Hilbert 
space. To see its significance, let us make a comparison with the case of subsystems 
of different types. First, one has to realise the obvious fact that observing any- 
thing on a subsystem is tantamount to observing something on the whole system. 
The application of this rule to system S' composite of two subsystems S\ and S2 
of different types has the following mathematical expression. Any effect Ei of Si 
determines uniquely the effect Ei ® I2 of S' . Thus, even if <Si is a part of S', we can 
measure Ei on it. Similarly, if state P[<f>i] of <Si and state P[02] of S2 are prepared 
independently, then the state of the composed system is P[0i] ® P[<fo] and the partial 
traces give back the originally prepared states of the subsystems. Thus, composing 
such systems does not disturb their individuality and rules valid for each of them 
separately. 

Looking at the composition of identical systems we observe a very different pic- 
ture. For instance, consider system S 2 of two bosons of type r with common Hilbert 
space H T . The Hilbert space of S 2 is H^ T y Let {ipk} be an orthonormal basis of 
H T . Then, the set of vectors 



is an orthonormal basis for H^Ln. Hence, if Preparation I prepares a boson in state 
P[<f>k] and independent Preparation II prepares another boson in P[4>i\, k ^ I, then 
neither boson seems to be in its individually prepared state. Clearly, the notion of 
individual states as partial traces does not make sense for particles of the same type. 
Moreover, the notion of preparation itself becomes problematical. 




■ip k <g> ip k , Vk 



and 




(2.6) 
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Next, let A be any operator on H T . Then, neither A <g> 1 nor 1 £g> A are operators 
on H^ T ) because they map any element of the above basis out of H^L^. Thus, 
they are not observables of any of the bosons. As H r is the Hilbert space of each 
boson constituent of S 2 , the effects of each boson would have to be operators on H T 
according to Rule 2. 

It seems therefore that unrestricted validity of Proposition 12 would contradict 
Rules of Sees. 1.1.2 and 1.2.2 on prepared states and form of registrable observ- 
ables. Clearly all this can be generalised to any number of both boson and fermion 
constituents. Let us study this problem in more detail. 

2.2.2 Cluster separability 

The problem described in the previous section is aggravated if the two systems 
are prepared in different laboratories. It seems that experiments with one object 
might be disturbed by another object of the same type, even if it were prepared 
independently, far away from the first. One can avoid similar problems by adding 
some assumption of locality to the axioms of quantum theory. 

The relativistic theory starts with the requirement that space-time symmetries of 
an isolated system (i.e., that is alone in space) be realised by unitary representations 
of Poincare group on the Hilbert space of states, see Refs. [72] and [28J . Then, the 
cluster decomposition principle, a locality assumption, states that if multi-particle 
scattering experiments are studied in distant laboratories, then the S'-matrix element 
for the overall process factorizes into those concerning only the experiments in the 
single laboratories. This ensures a factorisation of the corresponding transition 
probabilities, so that an experiment in one laboratory cannot influence the results 
obtained in another one. Cluster decomposition principle implies non-trivial local 
properties of the theory underlying the S'-matrix, in particular it plays a crucial part 
in making local field theory inevitable (cf. Ref. [72], Chap. 4). 

In the phenomenological theory of relativistic or non-relativistic many-body sys- 
tems, Hilbert space of an isolated system must also carry a unitary representation 
of Poincare or Galilei group. Then, the so-called cluster separability is a locality 
assumption, see, e.g., Refs. [38] or [18] and references therein. It is a condition on 
interaction terms in the generators of the space-time symmetry group saying: if the 
system is separated into disjoint subsystems (=clusters) by a sufficiently large space- 
like separation, then each subsystem behaves as an isolated system with a suitable 
representation of space-time symmetries on its Hilbert space, see Ref. [SS], Sec. 6.1. 
Let's call this principle cluster separability I. 

Another special case of locality assumption has been described by Peres, Ref. 
[56] . p. 128. Let us reformulate it as follows 
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Cluster Separability II No quantum experiment with a system in a local labora- 
tory is affected by the mere presence of an identical system in remote parts of the 
universe. 

It is well known (see, e.g., Ref. [5B], p. 136) that this principle leads to restrictions 
on possible statistics (fermions, bosons). What is less well known is that it also 
motivates non-trivial locality conditions on observables. 
The locality condition is formulated in Ref. [56], p. 128: 

... a state w is called remote if || Aw|| is vanishingly small, for any operator 
A which corresponds to a quantum test in a nearby location. ... We can 
now show that the entanglement of a local quantum system with another 
system in a remote state (as defined above) has no observable effect. 

This is a condition on A inasmuch as there has to be at least one remote state for 
A. 

However, Peres does not warn that the standard operators of quantum mechanics, 
which are in fact generators of space-time symmetries, do not satisfy his condition 
on A [31]. Similarly, basic observables of relativistic-field or many-body theories 
are generators of Poincare or Galilei groups and so they do not satisfy the locality 
condition, either. It follows that cluster separability II is logically independent from 
the cluster decomposition or of cluster separability I. Of course, this does not mean 
that the basic observables are to be rejected. They are very useful if the assumption 
of isolated system is a good approximation. However, this assumption is definitely 
a bad one for quantum theory of measurement. 

The present section reformulates and extends Peres' ideas. Let us explain every- 
thing, working in Q- representation of Hilbert space H T and of operators on it. Then, 
one can also write tensor products as ordinary products and indicate the order of 
factors by indices at system coordinates. 

Consider two experiments. In Experiment I, vector statd§ tp(xi) of particle S\ of 
type t is prepared in our laboratory. In Experiment II, state ip(xi) of particle Si is 
prepared as in Experiment I and vector state 0(x 2 ) of system <S 2 of the same type 
is prepared simultaneously in a remote laboratory. Then, according to Proposition 
12, the state of the composite system S must be 

*(fi,f 2 ) = ^(^(xi)0(f 2 ) + e(r)0(fi)^(x 2 )) , (2.7) 

where v = [2(1 + e| c| 2 )] " x / 2 is a normalisation factor, c = {ip\4>) while e(r) is 1 for 
bosons and -1 for fermions. 

2 We let out the variable £ because it will not play any role in the subsequent discussion. 
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Now, our lab claims that the state of Si is if)(xi) because this state has been 
prepared in the lab according to all rules of experimental art. However, somebody 
who knows what has happened in both labs claims that the preparation has been a 
combination of two sub-preparations and that the state is (12.7ft . Hence, the notion 
of preparation and state becomes ambiguous, at least if Rule 12 holds without any 
restrictions. This is true even if states if) and <f> are localized (the wave functions 
have supports) within the respective laboratories. 

Localisation of the states if) and <fi removes at least the following difficulty Sup- 
pose that a fermion is prepared in a distant laboratory in a state <fi. Then a fermion 
of the same type cannot be prepared in the same state if) = <f> in our laboratory: the 
composite state of the two fermions had then to be zero (Pauli exclusion principle). 
However, if the wave function of the fermion prepared in our laboratory falls off 
rapidly outside our laboratory and that in the remote laboratory does the same out- 
side the remote one, then the two wave functions would be different, if>(x) 7^ 4>{x), 
and their antisymmetric combination would not be zero even if if) was just a Eu- 
clidean group transform of 0. The requirement of the fall-off is very plausible indeed. 
It would be technically impossible to prepare a state with the same wave function if> 
in a laboratory located in Prague, and simultaneously in a laboratory in Bern, say, 
even for bosons. 

Let us return to the ambiguity of preparations. It would not be really harmful 
if it had no measurable consequences. But it apparently has! Suppose that the 
registration on Si corresponding to an observable a with kernel ) of Si is 

performed in our laboratory after Experiment II. It is then equally possible that 
the registration is made on Si or S2 and both can make a contribution to the 
outcome. Hence, the correct observable A that is associated with such a registration 
is described by kernel 

A(x%, x 2 ; x^, x 1 ^) = a(xi] x' 1 )5(x 2 — x' 2 ) + a(x 2 ; x' 2 )5(xi — x[) , (2.8) 

which is an operator on H^L^ so that it satisfies Proposition 12. 

The average of a in the state prepared in Experiment I is (^a^). On the other 
hand, Experiment II leads to the average: 

(jj\aiP) + (0|a0) + e(r)c(0|a^) + e(r)c* (V>|a0) 
l + e(r)|c| 2 

The two averages are different and the difference need not be small. For example, 
let the supports of the wave functions if) and be disjoint, each inside an open set 
around the corresponding laboratory. Then, c = 0. Consider the position operator 
of Si. The kernel is a(a ? i;x' 1 ) = xid(xi — x[), and the difference between the two 
averages becomes 
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This can be made arbitrary large by shifting the remote laboratory sufficiently far 
away. 

Thus, cluster separability is violated. However, the position does not satisfy 
Peres' condition: such an 'observable' controls position of the system in the whole 
infinite space. By that, it is utterly different from observables that can be registered 
in a human laboratory. It can analogously be shown that all standard operators of 
quantum mechanics are similar to position in this property. 

2.2.3 D-local states and observables 

This section is a more detailed and accurate account of the conditions on states and 
observables than that given by Peres. They will be assumed as at least approximately 
satisfied. 

We shall consider general iV-particle systems and only those operators A that fulfil 
the following condition. The kernel A(x±, • • • , xn', x[, ■ • • , x' N ) of A is a distribution 
on Schwartz space (its elements are rapidly decreasing C°° functions, see, e.g., Ref. 
|63j) S(IR 3Ar ) in variables x[, - ■ ■ ,x' N for any fixed value of x x , • • • , xjy such that 

g(xi, • • • , x N ) = J d 3N x'A(xi, • • • , x N ; x 1 ^ • ■ ■ , f^)/^, • • • , x N ) E S(R 3N ) 

for any f(x%, ■ ■ ■ ,%)■ This is usually satisfied, see Sec. 1.3.1. 

Definition 12 Let A{x\, ■ ■ ■ ,xn',x[,--- , x' N ) and B(xi, ■ ■ ■ , xn] x[, ■ ■ ■ , x' N ) be two 
kernels. Then, for any K = 0, 1, • • • ,N, 

J d X ki ' ' ' X k K ^(-^1; ' " " i X N\ x li ' ' ' i %n)-B(Xi, ' ' " > %n'i X l ' ' ' ' ' X n) (^-9) 

is a distribution acting on test functions of S(R 6N ~ 3K ) that maps test function 
f(x[, • • • , x' N , x'(, • • • , x" N )^' on test function g(x±, • • • , xn, x[, ■ ■ • , x' N ) A ' , where sym- 
bol (■ ■ • ) A/ means that K variables x* kv ■ • • , x' kR are left out from N primed variables 
x'±, • • • , x' N . Distribution Ii2.9\) is called convolution A*B over variables 

Let our laboratory takes an open set D C M 3 and similarly D' for the remote 
one. The change of S\ state due to some actions in a remote laboratory would not 
be measurable in our laboratory if the states and observables are D-local in the 
following sense: 

Definition 13 Let D C 1R 3 be open, let A be an operator of S and let the following 
conditions hold: 

1. A{x ir -- , xn'i x'i, ■ ■ ■ , x' N ) is the zero distribution for any x±, ■ ■ ■ , x^ G R 3 ^ \ 
D N . 
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2. 

J <i 3 x A(x±, • • • , xn] x 1 , • • • , x N )f{x 1 , • ■ ■ , x N ) = 
for any test function f such that 

suppfix',,-.. ,x' N )GR 3N \D N . 

Then A is called D -local. 

Let A be D-local for some open set D, let D' be open and D C Then, A is 
//-local. Let A be D-local and also D'-local for two open sets D and D'. Then, A 
is (D fl D')-\oc&\. Thus, all open sets D such that A is D-local form a filter in the 
Boolean lattice of open subsets of M 3 . 

Example 1 N — 1, x') = x 1 5(x — x'). A is M 3 -local and the filter is M 3 . 
Example 2 N = 1, -^(x) is a wave function with support D. Then \ip(x)){ip(x)\ is 
.D'-local, where D' is the interior of D, and the filter is the family of all open sets 
containing D' . 

Definition 14 A state with a D-local state operator T is called D-local. An observ- 
able E(X) ; all effects E(X) ; IgF, of which are D-local, is called D-local. 

The key property is the following: 

Proposition 13 Let system S\ of type T\ composite of Ni particles (not necessarily 
identical) be prepared in state Ti that is D\-local. Let system Si of type r 2 composite 
of N 2 particles be prepared in state T 2 that is D 2 -local, where D 1 fl D 2 = 0. Then, 
the state of Si + S 2 according to Rule 12 is 

P(T!®T 2 )P , 

where P is the symmetrisation over all identical bosonic and the anti-symmetrisation 
over all identical fermionic particles contained in S± + S 2 followed by multiplication 
by a normalisation factor. Furthermore: 

1. Let T\ = r 2 (N 1 = N 2 ) and let E(X) be a D-local observable for Si as well as 
for S 2 . Then, 

tr[(E(X) ® 1 + 1 <g> E(X))P(Ti ® T 2 )P] = *r[EpQTi] . (2.10) 

2. Let T\ 7^ r 2 and let E(X) be a D-local observable for Si. Then 

tr[(E(X) ® 1)P(T! ® T 2 )P] = tr[E(X)Ti] . (2.11) 
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Thus, the existence of identical systems somewhere else does not influence the reg- 
istrations of .D-local observables in our laboratory. 



Proof First, we formulate an immediate consequence of the definition of D-locality 
and of convolution. Let A be .D-local and B D'-local, where D D D' — 0. Then 
convolution A * B over any variables x^ , ■ ■ ■ , Xk K for K ^ is the zero distribution 
on S(R eN - 3K ). 

Let us first prove Part 1 of the proposition. Then, the kernel of E(X)®l+l£g>E(X) 

is 

E(X)(x ir -- } x N ;x' 1} --- ,x N )5(yi - y\) ■ ■ ■ 5(y N - y 1 ^ 

+ 5(x x - x[) ■ --S^n - & N )E(x)(y u ■■■ ,y N ;y[,--- ,$v) 

and that of Ti <g> T 2 is 

Ti(x ir -- } x N ;x" } --- } x'x)T 2 (g 1} --- ••• . (2.12) 

Operator P from the left acts in three steps. First, it makes permutations of all 
variables inside each group of variables x\, ■ ■ ■ ,x' N ,y[, ■ ■ ■ ,]f N that correspond to 
one type of particles in expression f)2.12p . Second, it sums with the correct sign 
over all such permutations that result in different terms. Finally, it multiplies the 
resulting sum by a normalisation factor. Operator P from the right acts similarly 
on variables x", ■■■ ,x'^,y'{,--- , y^. 

Thus, the trace on the left-hand side of Eq. (I2.10p is a sum of traces of all terms 
obtained in this way. Let one such term be 

J d 3 x[ . . . d 3 x' N d 3 y[ . . . d 3 y' N E(X)(x; x')S(y; y')^ )T 2 (- ••;••■) , 

where we have indicated variables x[, ■ ■ • , x' N by x', etc., and '• • • ' in arguments of Ti 
or T 2 stands for N variables from x[, ■ ■ ■ , x' N , y[, ■ ■ ■ , i7 N or from x'{, • • • , x" Nl ]?{,•■• , $v 
obtained by the corresponding permutations. 

If the permutations only change the order of x's or of y's, then all such terms are 
equal and the value of the trace is 




. . . d 3 x' N d 3 y[ . . . d 3 y' N E(X)(x; x')S(y, y'^x'; x")T 2 (y>; y") 



= tr[E{X)T 1 ] (2.13) 

If, however, the permutations mix K x's with K y's, then the 6iV-variables con- 
volution contains a /^-variables convolution E(X) * T 2 , which is zero. Hence, the 
contribution of all such terms is just ir[E(X)Ti]. 



54 



Next, consider the terms of the form 

d 3 x[ . . . d 3 x' N d 3 y[ . . . d 3 y' N 5(x, x')E(X)(y : y')^ )T 2 (- ••;•••) 



Analogously, only those permutations can contribute to the convolution that ex- 
change all x's with all y's so that E(X)(y : y 1 ) is convoluted with Ti(y';y"). This 
is possible because both systems are of the same type and so there is exactly one 
group of i/'s to each groups of x's and these two groups contain the same number of 
variables. Again, all such permutations give the same result and the total trace is, 
therefore, 2tr[E(X)T 1 \. 

The normalisation factor is 

(tr[P{T 1 ®l 2 )P}Y 1 

and it is clear that only those permutations contribute to the trace which do not 
lead to convolutions between Ti and T 2 . There are only two kinds of them: those 
mixing x's among themselves and y's among themselves, or those exchanging all x's 
with all y's. In each of the two cases, the trace is unity, hence the normalisation 
factor is 1/2, which implies Eq. (I2.10p . The proof of Part 2 is similar but simpler, 
Q.E.D. 

For example, if is D- local, \<fi)(4>\ is D'-local and the D-local kernel a£>(xi,af' 1 ) 

is used instead of a(x 1 ;x' 1 ) in formula (I2.8p defining operator instead of A we 
obtain 

' d 3 X\ / d 3 x[ / d 3 x 2 / d 3 x' 2 ^*{xi,X2)A D {xi,x\]X2 1 x' 2 )^{x\,x' 2 ) 

D JD JD JD 

POD 

d 3 x\ I d 3 x' 1 ip*(xi)aD(xi;x' 1 )ip(x' 1 ) 



as if no «S 2 existed. Thus, in this case, cluster separability holds. 

However, the ambiguity of the state of S\ in Experiment II still remains: the 
state prepared in our laboratory is \ip)(ip\ and, at the time that S2 is prepared, <Si 
loses its individual states if Rule 12 is valid without exceptions. This is clearly at 
variance with cluster separability; we shall remove this difficulty in Sec. 2.2.4. 

Next, we shall prove that each observable E of system S of type r is associated, 
for a reasonable open set D, with a unique Z)-local observable Ad(E), the so-called 
/^-localisation of E, such that the condition 

tr[E(X)J] = tr[A D (E(X))J] 

is satisfied for all X 6 F and for all Z)-local states T. Thus, any standard observable 
E such as position, momentum, etc. can be registered in D by registering A^E) 
and the probability distribution would be the same as if we had registered E. 
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Let us denote by H T (D) the Hilbert space obtained by completion of the linear 
space of C°°-functions with support in D N with respect to the inner product of H T . 
H T (D) is a closed linear subspace of H r . Let P D be the orthogonal projection from 
H r onto H T (D). 

Definition 15 Let 

A D : L r (H r ) ^ L r (H T (D)) 

be defined by 

A D (A) = P D AP D . 
Mapping Ad is called D -localisation. 

Clearly, D-localisation of any operator in L r (H r ) is Hermitean, bounded and posi- 
tive, hence 

A D : L r (H r )< 1 i->- L r (H r )< 1 . 

Of course, the D-localisation is not a unitary map. For example, it does not preserve 
operator norm, but it holds 

I|Ad(A)|| < ||A||. 

The operators and their D-localisations are considered as acting on H T . D-local 
operators leave H T (D) invariant and define, therefore, also operators on Hilbert 
space H T (D). 

Second, we can use these facts for a construction of D-local POV measure on 
H T (D) from any observable E on H T by D-localising the effects E(X). The normal- 
isation condition becomes 

A D (E(R")) = P D 1P D = 1 D , 

where n is the dimension of E (see Sec. 1.2.1). Hence, D-localisation of an observable 
on H r need not be an observable on H r but it is always one on H T (D). Also, D- 
localisation of a projection will not be a projection in general and so a D-localisation 
of a PV measure need not be a PV measure. Let us call this construction D- 
localisation of POV measures. All D-local POV measures commute with spectral 
projections of position PV measures, E^(X), if X fl D — 0, where E^(X) are sharp 
position observables that are well defined for S. For example, if S is composite of 
N particles of different type, then k — 1, • • • ,N and E^(X) is a spectral projection 
of the position of the k-th particle. Thus, the restriction to D-local observables may 
be formally understood in terms of superselection rules (see Sec. 3.3). 

If the map A D is involved in construction of D-local states, it must be followed 
by a suitable normalisation. A D-local state on H r is a state on H T (D) and vice 
versa. 
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2.2.4 Separation status 

With the help of .D-local states and observables, we could remove some contradic- 
tions due to unrestricted validity of Rule 12. The ambiguity of states, however, 
remained. This is a great and ubiquitous problem. Indeed, any microsystem S is 
always a subsystem of a huge family of identical subsystems that exist somewhere 
in the world. 

If we could write down the state of this huge family in accordance with unre- 
stricted Rule 12, then the results of any registrations in our laboratory would be 
well defined by it. However, such a state is not and cannot be known. Luckily, the 
results of registration of .D-local observables do not depend on the unknown parts 
of the state because of Proposition 13. This is the reason why the validity of Rule 
12 can and must be restricted. 

Clearly, one could ignore the entanglement of a single microsystem S with all mi- 
crosystems of the same type, if S had a non-trivial separation status in the following 
sense: 

Definition 16 Let D C M 3 be an open set and system S be prepared in state T. Let 
the following conditions be satisfied: 

• The probability to register value of observable E(X) in set X be tr[T'E(X)] for 
any D-local observable E(X) and for any state T' of S. 

• There is at least one D-local observable E'(X) such that tr[TE'(X)] ^ for 
some X . 

Then, open set D is called separation status of S. 

The first condition means that the registration of E(X) on T' is not disturbed by any 
identical system in a state different from T'. This includes the following condition 
that must be fulfilled by the apparatus A that registers observable E(X): The states 
of those subsystems of A that are identical with S do not disturb the registration 
of E(X) by A (but may disturb other registrations). A model of A satisfying the 
condition is given in Sec. 6.3. We can then view S as a physical object with a modified 
set of observables. The modification depends on open set D that is determined by 
a preparation of S. This motivates the following 

Rule 13 Effects of S that can be registered must lie inside subsets L r (H r (D))< 1 C 
L r (H T )< l7 where D is the separation status of S. 

Thus, strictly speaking, no quantum system is proper (see Sec. 1.2.2). 

For example, a microsystem that is alone in the Universe has separation status 
D = M?. This is a form of the assumption that a system is isolated. Observables 
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of such a system are the standard ones. The same microsystem in a open set D 
which is surrounded by matter containing a lot of microsystems of the same type 
such that supports of their states do not intersect D has separation status D and 
its observables are the Z)-local ones. The trivial case of separation status for a 
microsystem is D = 0. It has then no observables of its own. 

The composition of systems of the same type is determined by Rule 12 and its 
exceptions that are specified as follows. 

Rule 14 Let two systems, S and S' , have separation statuses D ^ and D' ^ ; 
respectively so that D D D' = 0. Then, the compositions of S and S' follow rules 
valid for composition of systems of different type even if S and S' have in common 
some subsystems of the same type. 

For instance, let the wave function of particle S with separation status D be ip{x) 
and let S' be a family of N particles identical with S. Let S' have separation status 
D' such that D' D D = 0. Its state \l/(xi, ■ ■ ■ ,xn) is symmetric or anti-symmetric 
in its N arguments according to the type. Then the wave function of composite 
system S + S' of N + 1 particles of the same type must be written as 

^(x)*(fi,-.. ,x N ) . (2.14) 

Observe that wave function (12.141) is not (anti-) symmetric in all N + 1 arguments! 
This is at variance with unrestricted validity of Rule 12. According to Rule 12, the 
wave function had to be 

P s Pj. JV + 1 )(^(f)*(f 1 ,... ,x N )) , (2.15) 

where Ps is the projection to the unit sphere of H^^. Now, it also ought to be 
clear why we do not employ Fock-space method to deal with identical systems: it 
automatically (anti-)symmetrises over all systems of the same type. 

Rule 14 is in complete agreement with the common practice in quantum mechan- 
ics. For example, in the theory of the experiment described in Sec. 0.1.2, a state 
is prepared as a state of an individual electron and its entanglement with all other 
electrons, which exist, in fact, everywhere in huge amounts, is serenely ignored. Due 
to Proposition 13, the results of such method would not lead to any contradictions. 

Let us give a mathematical example of separation status change. As for states, let 
it consist of the transformation of state (I2.14p into state (I2.15p . This transformation 
has the form 

Ps o : H ® h. H^ 1 , 

which is a non-invertible and non-linear map between two different Hilbert spaces. 
As for observables, in state (I2.14p . the observables that are registrable on S form 
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the algebra of D-local operators. Indeed, for any an G Ad, there is observable 
s D <g> 1 on S + S' . In state (I2.15p . the algebra would be A0 = 0: there are no 
observables whatsoever that would be registrable individually on S. Thus, the set 
of observables changes from Ad to A0. 

In classical mechanics, the possible states of system S are all positive normalised 
functions (distribution functions) on the phase space P and possible observables are 
all real function on P (at least, all such observables have definite averages on S 
independently of external circumstances). P is uniquely associated with the system 
alone and forms the basis of its kinematic description. Hence, transitions between 
different sets of observables similar to those described above would be impossi- 
ble in classical mechanics. They are only enabled in quantum mechanics by the 
non-objective character of observables: not only their values cannot be ascribed to 
microsystem S alone but some of them are not even registrable in principle due to 
external conditions in which S is. 

We assume that the quantum kinematics of a microsystem is defined mathemati- 
cally by possible states represented by all positive normalised (trace one) operators, 
and possible observables represented by some POV measures, on the Hilbert space 
associated with the system. Then the transitions of states and observables that go 
with changes of separation status cannot be viewed as a part of a dynamical trajec- 
tory due to some new version of the dynamics of S, but as a change of its kinematic 
description. Thus, although the change of separation status is similar to the collapse 
of the wave function (the non-local character included^!) , it is more radical and its 
physical roots are better understood. 

What has been said up to now shows that standard quantum mechanics is in- 
complete in the following sense: 

1 . It accepts and knows only two separation statuses: 

(a) that of isolated systems, D = R 3 , with the standard operators as observ- 
ables, and 

(b) that of a member of a system of identical particles, D — 0, with no 
observables of its own. 

2. It provides no rules for changes of separation status. 

Let us call these theories fixed status quantum mechanics (FSQM). Our main idea is: 
Quantum mechanics must be supplemented by a theory of general separation status 
and by new rules that govern processes in which separation status changes. The 

3 The time instant, when such a non-local change takes place, can in some cases be established 
by observations [37] . 
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new rules must not contradict the rest of quantum mechanics and ought to agree 
with and explain observational facts. 
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Chapter 3 

Further general ideas 



This chapter contains further important general ideas and closes the account of 
the general language of quantum mechanics. It explains the relation between the 
ontological and epistemic aspects of probability notion. It describes some crucial 
properties of quantum observables such as joint measurability and contextuality. 
Finally, it introduces the notion of superselection rules in order to make clear that 
our approaches to classical properties and quantum measurement have noting to do 
with the superselection-rules one. 

3.1 Probability and information 

Quantum mechanics is a statistical theory. Thus, a correct understanding of prob- 
ability and information is an important part in the conceptual framework of the 
theory. The discussion whether probability describes objective properties that can 
be observed in nature or subjective state of the knowledge of some human has 
raged since the invention of probability calculus by Jacob Bernoulli and Pierre- 
Simon Laplace [36J . The origin of the problem might be that the dispute cannot be 
decided: probability has both aspects, ontological and epistemic. 

Probability is a function of a proposition A and its value, p(A), is a measure 
of the degree of certainty that A is true. As a function on a Boolean lattice of 
propositions, it satisfies Cox' axioms [36]. Then it becomes a real additive measure 
on the lattice. Whether a proposition is true or false must be decided by observation, 
at least in principle. Hence, the probability always concerns objective events, at least 
indirectly. 

As an example, consider the state T of a quantum object S that has been prepared 
and a registration of an observable E on the state. The probability Pj{X) concerns 
the proposition that the value of E individually registered on T lies in the set X. The 
value of the probability can be verified by studying the events that happen during 
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such registrations. Thus, the probability concerns both the lack of knowledge of 
what objectively happens and properties of real systems. It is in this sense that 
probability has both epistemic and ontological aspects. 

In quantum mechanics, Pj{X) is not the probability that E possesses a value 
within X in the state T but the probability that a registration outcome will give 
such a value. The only exception is that T is a gemenge of E-eigenstates. A related 
question is whether the outcomes are predictable and we just have not enough infor- 
mation to make the prediction, or objectively unpredictable. Quantum mechanics 
does not deliver the prediction and it would even be incompatible with any 'deeper' 
theory that did (see Sec. 3.2.2). 

Let us describe the general framework that is necessary for any application of 
probability theory. First, there is a system, denote it by S. In the example above, 
there are certain conditions that are imposed on the quantum object S. State T 
is prepared: this is a definite objective condition on S. Observable E is registered: 
another objective condition on S. In general, there must always be an analogous 
set of conditions, let us denote it by C. To each system S subject to condition C 
possibilities in some range are open. These possibilities are described by a set of 
propositions that form a Boolean lattice F. A probability distribution p : F i— > [0, 1] 
is a real additive measure on F. 

If E is a discrete observable, its value set fi is at most countable, fi = {ui, cj 2 • • ■ }■ 
Then the single-element sets {uk}, k — 1, 2, • • • , are atoms of F that generate F and 
the probability distribution Pj(X) can be calculated from pj({uk}) by means of Cox' 
axioms. The atoms are called outcomes. If E is a sharp continuous observable, then 
there are no atoms but continuous observables can be considered as idealisations of 
more realistic POV measures with well-defined atoms. (The reading scale of each 
registration apparatus can be divided into separate finite intervals, the width of 
which corresponds to the accuracy that can really be achieved, etc. see, e.g., Ref. 
[IB], p. 35.) 

If condition C is reproducible or it obtains spontaneously sufficiently often, any- 
body can test the value of the probabilities. The probability distribution p(X) is 
therefore an objective property of condition C on 5. It is so even in cases when the 
outcomes can be predicted uniquely. This would depend on whether condition C 
can be sharpened to other conditions Ck so that 

1. whenever Ck is satisfied, so is C, 

2. each Ck is still recognisable and reproducible and 

3. each outcome allowed by C is uniquely determined by one of C^'s. 

Even in such a case, condition C itself leaves the system a definite amount of free- 
dom that can be described in all detail by the probability distribution p(X) : it 
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is an objective, verifiable property of C alone. And, if we know only that condi- 
tion C obtains and that a probability distribution is its objective property, then 
this probability distribution describes the state of our knowledge, independently of 
whether the conditions Ck do exist and we just do not know which of them obtains 
in each case or not. There is, therefore, no contradiction between the objective and 
subjective aspects of probability. 

Entropy is a certain functional of p(X) that inherits both objective and subjective 
aspects from probability. Discussions similar to those about probability spoil the 
atmosphere about entropy. The existence of a subjective aspect of entropy — the lack 
of information — seduces people to ask confused questions such as: Can a change of 
our knowledge about a system change the system energy? 

The general definition of entropy as a measure of missing information has been 
given by Shannon [66J and its various applications to communication theory are, 
e.g., described in the book by Pierce [57] • Our version is: 

Definition 17 Let p : F i— > [0, 1] be a probability distribution and let Xk be the 
atoms ofF. Then the entropy of p(X) is 



Let us return to the quantum-mechanical example in order to explain which 
information is concerned. After the choice of preparation and registration devices, 
we do not know what will be the outcome of a registration but we just know that 
any outcome Xk of the registration has probability pj(Xk). After an individual 
registration, one particular outcome will be known with certainty. The amount of 
information gained by the registration is the value of S given by Eq. (13. ip . For more 
detail, see Ref. [56J. Thus, the value of S measures the lack of information before 
the registration. 

The entropy and the so-called Maximum Entropy Principle^ (MEP) have become 
important notions of mathematical probability calculus, see, e.g., Ref. (36]. The 
mathematical problem MEP solves can be generally characterised as follows. Let 
system S, condition C and lattice F with atoms Xk be given. Let there be more 
than one set of p(ATj,)'s that is compatible with C. How the probabilities p(Xk) are 
to be assigned so that condition C is properly accounted for without any additional 
bias? Such p(Xk) J s yield the maximum of S as given by ( 13. ip . MEP clearly follows 
from the meaning of entropy as a measure of lack of information. We shall use this 
kind of MEP. 

1 There is also a principle of statistical thermodynamics that carries the same name but ought 
not to be confused with the mathematical MEP. 




(3.1) 



k 
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In quantum mechanics, probability distributions are generally associated with 
both states and observables. This has to do with the non-objective nature of ob- 
servables. Still, can there be a general measure of information lack associated with 
a given state alone? Clearly, such a question is meaningful and the answer is the 
minimum of entropies, each associated with the state and an observable. The mini- 
mum, S(T) is a well-defined function of state T and is called von Neumann entropy. 
One can show (see, e.g., Ref. [56]) that, up to a constant factor, 

S(T) = -tr[Tln(T)]. (3.2) 

As each T must have a discrete spectrum with positive eigenvalues t^, we have 

s(T) = -Y,t*H-tk) ■ 

k 

The lack of information S(T) is associated with the state T of object S and 
consequently with the (classical) condition C that defines the preparation. Hence, 
according to our criterion of objectivity, von Neumann entropy is an objective prop- 
erty of object S prepared in state T. 

MEP together with the von Neumann entropy can be applied to thermodynamics. 
Consider the problem: what is the state T that maximises von Neumann entropy 
under the condition that it has a given average £r[TE] of energy (this is an objective 
property in our approach)? The answer is: the Gibbs' state and the value of La- 
grange multiplier that multiplies the average of energy in the variational principle is 
1//cTb, where T is the temperature and ks Boltzmann constant (for proof see, e.g. 

M- 

Von Neumann entropy of composite objects satisfies a number of interesting in- 
equalities, see, e.g., Ref. [71]. One is the so-called sub-additivity [2]. 

Proposition 14 Let S be composed of objects S\ and 5 2 of different types. Let T 
be the state of S and let 

T 1 = n 2 (T) , T 2 = nx(T) . 

Then 

S(T) < 5(T X ) + S(T 2 ) 
and the equality sign is valid only if 

T = Ti <g) T 2 . 

Thus, von Neumann entropy can be used to measure the amount of entanglement 
in a state of a composite object. 
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3.2 Joint measurability and context uality 



Joint measurability is related to contextuality by the mathematics used to describe 
each. Joint measurability is defined by a map from a Boolean lattice into the lattice 
of effects, and contextuality is related to maps from the sublattice of projections 
into the Boolean lattice {0, 1}. 

3.2.1 Joint measurability 

Let A and B be two s.a. operators on Hilbert space H that are not necessarily 
bounded. Then the product AB as well as the operators AB + BA and AB — BA 
are well defined on their common invariant domain D (see Sec. 1.3.1). The latter 
operator (or its extension) is called commutator of A and B and is denoted by [A, B]. 

Proposition 15 Let A and B be two s.a. operators with a common invariant do- 
main, let i[A, B] have a s.a. extension and let T be an arbitrary state. Then 



Equation (13. 3p is called uncertainty relation. 

The interpretation of the uncertainty relation is as follows. If we prepare many 
copies of an object in state T and register either A or B or i[A, B] on each, then the 
average of [A, B] and variances of A as well as of B will satisfy Eq. (13. 3ft . It is not 
necessary to register all observables jointly. 

An important notion of quantum mechanics is that of joint measurability. Often, 
it is also called simultaneous measurability but the notion has nothing to do with 
time. It simply means that there is one registration device that can measure both 
quantities. 

Definition 18 Two elements Ei and E 2 o/L r (H)< 1 are jointly measurable if there 
is a POV measure E : F \-> L^H)^ such that Ei = E(Xi) and E 2 = E(X 2 ) for some 
Xi and X 2 in F. 

This is a mathematical property that, in fact, is only a necessary condition for 
existence of the required registration device. Even if E existed, the question whether 
it is an observable is non-trivial. 

Proposition 16 Two effects Ei and E 2 are jointly measurable if and only if there 
are three elements E' 1; E' 2 and E' 12 in L^H)^ such that 




(3.3) 



2 



E' 1 + E 2 + E' 12 GL r (H) 



+ 
<i 
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and 

Ei = E 1 + E' 12 , E 2 = E 2 + E 12 . 

For proof, see jH], p. 89. Proposition 16 has two corollaries: 1) two projections are 
jointly measurable if they commute and 2) a sufficient condition for joint measura- 
bility of Ei and E 2 is that 

Ei + E 2 e L r (H)^. 

If an observable is sharp then its effects commute, but the effects of a general 
observable need not commute. Thus, even non commuting operators can be jointly 
measurable. This is completely compatible with standard quantum mechanics. In- 
deed, any POV measure that is an observable of an object S can be measured as 
a sharp observable of a composite system containing S and another object called 
ancilla as subsystems. For proof, see e.g. [56], p. 285. An important example will 
be given in Sec. 4.4. 

Two observables Ei : 1— > L r (H) and E 2 : F 2 1— > L r (H) with F x comprising the 
Borel subsets of K" 1 and F 2 those of M™ 2 are called jointly measurable, if each two 
effects Ei(Xi) and E 2 (X 2 ) are jointly measurable. In this case, there is an effect 
Ei(Xi) A E 2 (X 2 ) in L r (H)< 1 giving the probability that observable E x has value in 
Xi and observable E 2 has value in X 2 . Then, there is a unique observable 

E : F h-> L r (H) , 

where F is the set of Borel subsets of M. ni+n2 and 

E(Xl x X 2 ) = E 1 (X 1 ) A E 2 (X 2 ) . 

We call E compound of Ei and E 2 

An important example are two sharp observables A and B. They are jointly 
measurable, if and only if all their effects (which are projections) commute with 
each other, i.e., the operators commute. In this case, the wedge is just the ordinary 
operator product, 

E A (A 1 )AE B (X 2 ) = E A (X 1 )E B (A 2 ) . 

The compound E AAB of E A and E B is an observable that represents registration of a 
pair of values, one of A, the other of B. 

If two sharp observables A and B do not commute then they can be jointly 
measurable at best with the inaccuracy corresponding to their uncertainty relation. 
This means that only some effects of A are jointly measurable with only some effects 
of B. An example will be given in Sec. 4.4. 
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3.2.2 Contextuality 

The investigations in the field of contextuality were motivated by the following 
problem. In Newton mechanics, a general state of any system S is described by 
probability distributions p : Z h- >■ [0, 1], Z G T on its phase space T. Any such 
distribution results from a fixed preparation and describes the ensemble of individual 
systems S prepared in this way. Each individual system of the ensemble is, however, 
always assumed, at least in Newton mechanics, to be in some state given by a point 
Z of r. We just do not know which point and the distribution p(Z) describes the 
state of our incomplete knowledge. Thus, all observables, which are just functions 
on r, have determinate values for each element of the ensemble. 

Can anything analogous be also assumed for quantum mechanics? Could a state 
operator of a quantum object S describe our knowledge of an ensemble defined 
by the preparation and could the individual elements of the ensemble each have 
determinate values of all observables, which are just not known, or even, be it for 
whatever reason, could not be known? To solve this problem, it is sufficient to 
restrict the observables to sharp observables, because if the question had negative 
answer for a restricted class, it would have negative answer for any class containing 
the restricted one. 

It is technically advantageous to restrict the problem even further by limiting 
oneself to (perpendicular) projection operators to subspaces of H T . Let us denote the 
set of all such projections by P(H T ) C L r (H T ). The problem can then be formulated 
as follows. Is there a dispersion-free probability distribution h : P(H T ) i— > {0,1}? 
Dispersion-free means that its values are just zero and one so that such a distribution 
determines values of all projections. 

The dispersion-free distribution h has to fulfil certain conditions, or else it could 
not be interpreted as concerning properties. Let us first describe the structure of 
P(H T ). 

Each projection a G P(H T ) can be mapped on linear subspace a(H T ) of H r and 
this bijective map allows to define the lattice relations on P(H r ). First, a < b if 
a(H T ) C b(H r ). This defines a partial ordering on P(H T ). Second, a = b 1 - if a(H T ) 
contains all vectors orthogonal to b(H r ). Projection a is called the orthocomplement 
of b. Third, c = a A b if c(H T ) is the set-theoretical intersection of a(H r ) and b(H r ). 
Observe that aAb = ab (operator product of the projections) only if a and b commute 
(are orthogonal). Finally, c = a V b if c(H T ) is the linear hull of a(H T ) and b(H r ). 

P(H T ) with these relations forms the so-called orthocomplemented lattice (for 
proof, see, e.g., Refs. [HIE])' but not a Boolean lattice^. It has however Boolean 

2 In the so-called quantum logic, properties of S are described by elements of P(H r ). They 
represent the mathematical counterpart of the so-called YES-NO registrations [58]. If we pre- 
tend that the values obtained in the YES-NO experiments are properties of a well-defined single 



67 



sublattices, which represent sets of jointly measurable sharp effects, and must, there- 
fore contain only mutually commuting projectors. On these sublattices, h is, in fact, 
an assignment of truth values 0, 1 and it has to satisfy the usual logical conditions 

/i(aVb)=/i(a)V/i(b), h(a A b) = h{a) A h(b) , h^) = h{a) L . 

In these relations, we consider the set {0, 1} as a Boolean lattice of one empty set 
and one arbitrary non-empty set A with _L the set-theoretical complement in A, 
V the set-theoretical union and A the set-theoretical intersection. Thus, on each 
Boolean sublattice, h must also be a Boolean lattice homomorphism. 

The first result relevant to the question of existence of such maps is the Gleason's 
theorem |25^l . It states that the set of all probability distributions on P(H T ) is just 
T(H r ) ] f . Thus, our dispersion-free distributions are to be found in T(H T ) ] f . It is 
easy to show that there are none there. 

One can object that P(H T ) is an idealisation containing an infinity of observables. 
This leads to the question whether there is a finite subset of P(H T ) that does not 
admit such distributions, either. This is the next relevant result, Kochen-Specker 
no-go theorem [40J, in which an example of such a subset is given. There are more 
examples now provided by different physicists (see, e.g., Ref. [H]). The reason why 
h does not exist is that the assignment of a truth value to projector a, say, depends 
on to which Boolean sublattice a belongs. This can be understood as 'context': in 
one case, a is registered jointly with elements of one Boolean sublattice, in another 
case with those of another sublattice. 

The definitive result in this field is Bub-Clifton-Goldstein theorem [15] which 
lists all maximal sub-lattices of P(H T ) that admit dispersion- free probability dis- 
tributions. They need not be Boolean but they are always only proper sublattices 
of P(H T ). Hence, only a limited number of properties can be assumed to be de- 
terminate before registration, and this limits possible 'non-collapse' interpretations 
and modifications of quantum mechanics. In fact, these interpretations or modifica- 
tions can be classified according to the sublattices determined by the Bub- Clifton- 
Goldstein theorem [15j . 

3.3 Superselection rules 

A preparation that deals only with systems of one and the same type prepares so 
called one-type systems. Not every preparation is such. For example, we can ran- 

quantum system, then we are forced to replace the Boolean lattice of ordinary logic by the ortho- 
complemented latice of quantum logic. But this pretence is against all logic because these values 
are not properties of one but of many different systems each consisting of S plus some registration 
apparatus. 

3 Gleason's theorem is valid for Hilbert spaces of dimensions greater than 2. 
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domly mix electrons in some states with protons in some states. Such systems, 
and some generalisations of them, are called n*rf syste m & In this section, the 
mathematical description of mixed systems will be explained. We shall mix just two 
one- type systems, Si and 5 2 , but the generalisation to any number is straightfor- 
ward. 

Let Hx and H 2 be the Hilbert spaces of <Si and Then the Hilbert space H of 
the system S mixing Si and S 2 is 

H = Hi © H 2 . 

The direct sum on the right-hand side is the space of pairs, ("01, V^), 4>i £ Hi and 
i>2 £ H 2 with linear superposition defined by 

a(i>i, ^2) + 4>2) = {aipi + 60i, atp 2 + b(p 2 ) 
and the inner product defined by 

(001, ^2), (01, 02)) = ("01, 0l)l + ("02, 02)2 , 

where (•, -)j is the inner product of Hj. There are then embeddings tj : Hj 1— )■ H 
defined by 

= (Vi,0) , ^2) = (0,Va) 

for any ^ G Hj and i — 1,2. 

Let L(H) be the algebra of bounded linear operators on H and L(Hj) that of Hj. 
If two operators Aj G L(Hj), i = 1,2, are given, then an operator (Ai © A 2 ) G L(H), 
called direct sum of Ai and A 2 , is defined by 

(Ai © A 2 )(^i,^ 2 ) = (Ai(^i),A 2 (^ 2 )) 

for any ("01, ^2 ) £ H. The special property of the direct-sum operators is that the 
subspaces t»(H$) C H are invariant with respect to them. Clearly, not all operators 
in L(H) are of this form. Effects of the form A x © 2 (Oi © A 2 ) can be interpreted 
as representing registrations done on Si (S 2 ) alone. 

The embeddings ti define maps on projections P[ipi], which can be denoted by 
the same symbol, viz. 

ti(P[Vi]) = p[lM] 

for all ipi G Hj. ^ can be extended to the whole spaces T(Hj)^, tj : T(H i ) ] f !->■ 
T(H)+ as follows. Let Tj G T(Hj) + , then 

ti(T 1 ) = (T 1 ,0) , ^ 2 (T 2 ) = (0,T 2 ) . 

4 This has nothing to do with the term 'mixed state', which is sometimes used for non- vector 
states. 
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The convex combinations of states from T(Hi)^ and T(H 2 )i~ are states of mixed 
systems defined at the beginning of this section. However, the states T(H)^ are not 
exhausted by convex combinations of states from T^Hx)^ and T(H 2 )+. 

The structural properties in which the systems Si and 5 2 differ from each other 
can now be viewed as non-trivial operators on H. Let for example the masses fi\ and 
fi2 satisfy yUi 7^ yU 2 . Then the s.a. operator fi±li © A* 2 1 2 defines the mass operator m 
on H with two eigenspaces tj(Hj) and eigenvalues Hi, % = 1,2. Similarly for charges, 
spins, etc. Now, it is easy to see that all operators of L(H) that commute with m 
have the form Ai © A 2 and all operators of this form commute with m. Clearly, s.a. 
operators of the form A x © A 2 are observables of system S. The nature of mixing 
systems suggests the basic rule 

Rule 15 All effects that can be registered on system S mixing S\ and <S 2 have the 
form A 1 © A 2 where A 1 G L^Hi)^ and A 2 G L^Ha)^. 

Hence, the space of all observables of S is not L r (H)< 1 but the subset of all 
observables that commute with m. 
In general, we define 

Definition 19 A discrete sharp observable Z of a system S with Hilbert space H is 
called superselection observable if Z commutes with all observables of S, sharp or 
not sharp. The existence of such an observable is called superselection rule. The 
eigenspaces of Z are called superselection sectors. All superselection observables 
form the centre Z of the algebra L(H). 

A restriction of the set of observables for system S suggests the introduction of 
an equivalence relation on the set of states T(H r )^. 

Definition 20 Two states T 1 and T 2 are equivalent with respect to a set O of 
observables if these states assign the same probability measures to each observable of 
O, that is , pj i = pj 2 for each E e O. In that case, we write T x = T 2 . 

For example, linear superposition 

tt = a(V>i,0) + 6(0,^2) 

for any ^ G Hj, i — 1,2, \a\ 2 + \b\ 2 = 1, and convex combination 

T= |o|\(|^i)(^i|) + |fe|^ 2 (|^2)(^ 2 |) 
are equivalent with respect to the set of superselection observables O, 

|*)<*|=oT. 
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For a given set of observables O, =o is, indeed, an equivalence relation on the 
set of states T(H)^. Let us denote the set of equivalence classes of states in T(H)^ 
with respect to the set of observables O by To(H)^. No two states of the same 
class can be distinguished by any registrations. No two different states from T(H)^ 
are equivalent with respect to all observables unless the set of effects O is restricted. 

Proposition 17 Given a system S with the sets O andTo(H)i of observables and 
state classes, respectively. Then the following statements are equivalent: 

A Z is a supers election observable. 

B Each state is equivalent to a unique convex combination of eigenstates of Z. 
See Ref. [IS], p. 18. 

Rule 16 The states of system S mixing Si and S2 are the equivalence classes with 
respect to the set of observables O of the form Ai © A2. The unique convex combi- 
nation of eigenstates of Z to which each element of T(H)+ is equivalent describes 
the gemenge structure of the class. 

This motivates attempts to utilise superselection rules for solution of the problem of 
classical properties and of quantum measurement [3H El HU [651 [62] . The formalism 
of this section follows Ref. P3] because it gives some meaning to the linear superpo- 
sitions of states from different superselection sectors, which is used by some of the 
references mentioned. 
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Part II 
The models 
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In this part, the basic notions and rules of Part 1 will be employed to construct 
models of observed systems or at least of some aspects thereof. The relation be- 
tween Part 1 and Part 2 within any theory is more or less loose. The general 
language is used to construct models but it does not determine which models are to 
be constructed. Indeed, it could be used also to work with arbitrary or even fancy 
models that need not have much in common with practical experience. The scien- 
tific practice is to encounter various systems in nature and to construct models of 
these observed systems in order to understand and predict their properties. We can 
also improve earlier constructed models in order that they include newly observed 
properties, etc. 

The division of a physical theory into a general part and a model part may 
have some relevance to Popper's philosophy (cf. [59]). The first part is a kind of 
language that does not have any specific, numerical consequences. Only the models 
give specific, numerical values that can be compared with experimental results and 
observations. It seems therefore that the general part is practically not falsifiable but 
a specific model surely is. Thus, a failure of a theory can always be interpreted as a 
failure of some models. Two different reactions to such a falsification are possible. 
Either one constructs a completely new theory starting with the general part, or 
one tries to improve the models. For example, the difficulties of Newton mechanics 
in the domain of quantum phenomena can either lead to the invention of quantum 
mechanics or to the improvement of the corresponding Newton-mechanical models. 
Indeed, the Bohm-de Broglie pilot-wave theory can be viewed as an example of the 
latter approach. 

The menagerie of successful models is a very important part of any physical the- 
ory. Textbooks of quantum mechanics dedicate most of their space to an account 
of it. We shall restrict ourself only to those models that are immediately impor- 
tant to our main aim: to solve the problem of classical properties and quantum 
measurement. 
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Chapter 4 

Models of microscopic systems 



In this chapter, we limit ourselves just to few examples of microsystems, first to show 
the crucial steps of model construction within our conception of quantum mechanics 
and second to introduce some models that will be important later. 

The construction of quantum models can be divided into two steps. First, as- 
sumptions are made about states and observables of particles. Second, the states 
and observables of a composite quantum system can be constructed as described in 
Sees. 2.1 and 2.2 in agreement with its structural properties such as composition 
and a specific Hamiltonian. 

The operators that will be defined in the present section will have some relations 
to similar quantities that describe classical systems of an analogous composition 
and they will have names that do not seem to be justified by anything but this 
classical analogy. This is true but it will be interpreted in this paper as follows. 
The definition of the operators is a purely quantum-mechanical business and the 
role of the corresponding observables as well as their registration method belong 
to important new assumptions of the theory. The only concession we can do is 
that the classical mechanics can give us a hint on how quantum operators may be 
constructed, but the definitive decision is only possible if the model works, that is, 
predicts properly what can be observed. 

One often tries to find some way from classical systems to analogous quantum 
systems, a so-called 'quantisation' method, or at least to formulate some kind of 
'correspondence principle' concerning relations between classical and quantum me- 
chanics. In our opinion, the natural way is just the opposite and must lead from 
quantum mechanics to the classical one because classical systems are just some 
macroscopic quantum systems. This ought to explain any correspondence. 

Model assumptions are assumptions about specific systems that are to be mod- 
elled. They will, therefore, be called 'Assumptions' to distinguish them from general 
'Rules'. 
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4.1 Basic PV measures 



Let us start with the following structural properties: a particle with mass fi, spin s 
and charge e (this determines the type r). To construct the Hilbert space H MjSje of 
such model, we shall work in Q-representation. Consider the set of rapidly decreasing 
C°° functions 4>(x,m) (wave functions), where m e {— s, ■ ■ ■ , s}. Then, H M s e is the 
completion of the set of the wave functions with respect to the inner product 

{(/>\ip) = 2, / d 3 xcj)* (x, m)ip(x, m) . 

The states of the particle that can be prepared are elements of T(H At s e )^. 

Next, we define certain operators on H^^: position q = (q^q^q 3 ), (linear) 
momentum p = (p 1 , p 2 , p 3 ) and spin s = (s 1 , s 2 , s 3 ), where the symbol q denotes the 
compound observable (see Sec. 3.2.1) q 1 A q 2 A q 3 and similarly for other vectorial 
operators. 

The position and momentum operators have already been defined in Sec. 1.3, Eqs. 
(11.2ip and fll.22p . The three components of spin are (2s + 1) x (2s + 1) Hermitean 
matrices s^ n that act on wave functions (vector-valued functions) as follows 

s 

s k (f)(x,m) = * k mn^,n) . (4.1) 

n=—s 

In non relativistic quantum mechanics, one mostly works only with s = or s = 1/2. 
The matrices for s = 1/2 are 

'-<o-0- ^ 

All components of q, p and s can be extended to self-adjoint (s.a.) operators on 

The common invariant domain of operators q, p and s is the set of rapidly de- 
creasing C°° functions 4>(x, m). Hence one can construct other operators from them 
by products and linear combinations, e.g., orbital angular momentum 

I 1 „2„3 „3„2 I 2 _3„1 „1„3 i 3 „1„2 „2„1 

L=qp-qp, L=qp-qp, L=qp-qp. 

The operator has a unique s.a. extension. One of the assumptions of quantum 
mechanics is that one can construct all observables for a particle from its basic 
operators. This is the reason why they are called basic. The construction is simple, 
if the resulting observable is a polynomial. More complicated functions can be 
constructed using spectral theorem. For example, any function V(q) can be defined 
by 

V(q) = [ V(i\L 2 ,t 3 )dE^ . 

JR3 



75 



-t -* 3 

Let us construct the PV measures E q , E p and E s . The dimension of the first two 
is 3 and F is the set of all Borel subsets of M 3 . We define 



E q (X)ip( y x,m) = x[X](x)^(x,m) , 



(4.3) 



where x[X] is the characteristic function of X G F, that is, x[X](x) = 1 for x E X 
and x[X](x) = otherwise. Clearly, the value set of E q is the whole of R 3 . 
For E p , we define 



where = 1 for p G X and x[X](p) = otherwise. The value set of E p is the 

whole of M 3 . 

The three components of spin cannot be combined to a single PV measure because 
they do not commute. Let us describe the third component s 3 . PV measure E s3 is 
one-dimensional and F is the set of Borel subsets of R. We define 



where x[X](m) = 1 if m E X C\ {— s, ■ ■ ■ ,s} and x[X](m) = otherwise. E s is a 
discrete PV measures with value space {— s, ■ ■ ■ ,s}. 
Let us summarize: 

Assumption 1 Any particle with mass fi, spin s and charge e is a quantum system 
with Hilbert space H^g. Its basic operators are position, momentum and spin, 
defined by Eqs. M.21\) , M.22\) , as well as &4-l\) and h4-2\ ) for particles with spin 1/2. 

According to Sec. 2.2.2, these basic PV measures are not observables. To obtain 
observables from them, separation status D in which we are going to prepare all 
particles is to be determined and D-localisation of the PV measures have to be 
constructed. We shall speak about measurement of basic observables understanding 
the so constructed POV measures. 

Suppose that a free particle is prepared in D. Registration of the (smeared) 
position of the particle can e.g. be carried out by a small Geiger counter that is 
placed so that its sensitive volume V is at a fixed position in D. If it counts at time t, 
then we know that the registered position of the quantum system were in V at t. The 
momentum of the particle can be inferred from its energy because its Hamiltonian 
is a quadratic function of the momentum (see the next subsection), and from the 
direction of motion that is given, e.g., by apertures in screens. This functional 




3 

E s (X)tfj(x,m) = x[X](m)il>(x, m) , 



(4.5) 
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relation between momentum and Hamiltonian is of course only approximate for D- 
localised quantities. The spin can be registered via the Stern-Gerlach apparatus 
because even neutral particles have a non-zero gyromagnetic ratio. 

Finally, the basic operators of a composite system can be obtained from the basic 
operators of its subsystems. A prominent example in any textbook of quantum 
mechanics is the hydrogen atom. A possible model of it is composed of an electron 
and a proton. The Hilbert space of hydrogen is H e ® H p , where H e and H p are 
those of the electron and the proton. These are just H MiSje with suitable numerical 
values of /i, s and e. The basic operators are q e <g> l p , p e ® l p , Se <8> l p , l e <8> qp, l e <S> P P , 
l e <g> s p . 

Further operators can be constructed by algebraic operations from the basic ones, 
and the corresponding observables then by D-localisation. 



4.2 Hamiltonian 

Assumption 2 Hamiltonian of any particle with mass fi, charge e and gyromag- 
netic ratio g moving in classical electrostatic potential V(x) and classical stationary 
magnetic field B{x) is 

H = ^- + eV(q)-gs-B(q) , 

where the square means the squared norm of a 3-vector and '■ ' is the inner product 
of 3-vectors. 

A textbook model of hydrogen atom consists of a spin-zero proton of mass fi p and 
charge e and a spin-zero electron of mass /i e and charge —e. The Hamiltonian of 
the model is 



2 fx e 2 n P \q e — qp| 

For any quantum system S with Hamiltonian H, the spectrum of the s.a. operator 
H is called energy spectrum of S. The energy spectrum is thus a structural property. 
Systems can be recognised via their spectra. If light comes from far away in the 
space and the (possibly Doppler shifted) Balmer series is found in its spectrum, 
hydrogen must be somewhere there. 

Kinetic energy of a free system can e.g. be measured by a proportional counter. 
In fact, what is measured in this way is a D-localisation of the total kinetic energy, 
which is a s.a. operator that forms a part of the Hamiltonian. For a particle, it is 

2/i 

and the spectrum is [0, oo). For the hydrogen atom, we had first to separate the 
motion of its centre of mass from its internal degrees of freedom. An utterly different 
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business is a measurement of a spectrum of a bound system. Spectra of bound 
systems are discrete and can be measured if large number of the same systems 
emits or absorbs electromagnetic radiation. This is an indirect measurement via 
scattering of photons in which the final 'kinetic' energy of photons is measured by 
a spectrometer. 

4.3 Galilean group 

A free particle is defined by V(q) = const and B(q) = 0. It is an isolated quantum 
system S and Galilean group has, therefore, a unitary ray representation on H ftS . e , 
see Sec. 1.3. We can describe it by writing up its generators. From Sec. 1.3, we 
know already the form of space translations, space rotations and time translations 
even for systems that are not isolated. The additional assumption is 

Assumption 3 For a particle (not necessarily free) with mass \i and spin s, the 
total angular momentum is J fc = L k + s k . For a free particle, the boosts are generated 
by b k = -/J>q h . 

All operators are written in the Schrodinger picture. The only group representatives 
where multipliers appear is multiplication of boosts, U(V), with space shifts U(a). 
This leads to 'anomalous' commutator, 

[b fe , pi] = -ihfidki. 

It is easy to work out how the corresponding one-parameter groups act on wave 
function ip(x, m): 

U(a)ip(x, m) = ip(x — a, m) 

and 

\J(v)ip(x, m) = exp ^— nv ■ x^j ip(x,m) . 

Thus, the boost just shifts the momentum by fiv. For example, the average of 
momentum in the shifted state equals to that in the original one plus fiv. The 
operators b k and generate a subgroup of G called Heisenberg group. 

4.4 Joint measurement of position 
and momentum 

Existence of an observable that represents a joint measurement of position and mo- 
mentum will play some role in Chap. 5. To this aim, we generalise the construction 
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of such an observable for a simplified model that was first proposed in Ref. [T]. The 
model system S is a free one- dimensional spin-zero particle with position q and mo- 
mentum p. The Hilbert space is L 2 (R) and the operators are defined by equations 
analogous to fTOll and ffl~22l) . 

Operators q and p have an invariant common domain and their commutator is 
easily calculated to be 

[q, p] = ih . 

Hence, the joint measurement may be a problem. 

The general construction of a non-trivial POV measure for system S introduces 
another system, ancilla, that forms a composite system with S. Let our ancilla A be 
a similar particle with position Q and momentum P. We work in Q-representation 
so that the Hilbert space of the composite system S + A is L 2 (IR) <g> L 2 (R) which 
can be identified with L 2 (R 2 ). Then, we have wave functions ty(q,Q) and integral 
operators with kernels of the form 0(q, Q; q', Q'). 

The dynamical variables A = q — Q and B = p + P of the composite system S + A 
commute and can therefore be measured jointly. The value space of PV observable 
E AAB is R 2 with coordinates a and b (see end of Sec. 3.2.1). 

The next step is to smear E AAB to obtain a realistic POV meausre E k . Let us divide 
the ab plane into disjoint rectangular cells X k = [a k -,a k+ ] x [bk-,bk+\ covering the 
entire plane. Each cell is centred at (a^, bk), a k = (ak+ + «fc-)/2, bk = (bk+ + fefc_)/2 
and Sk = (dk+ — ctfc_)(&fc+ ~ is its area. Then, 

E k = E AAB (X fc ) = E A (K_,a fc+ ])E B ([6 fc _A + ]) . (4.6) 

The cells can be arbitrarily small. 

The probability to obtain the outcome k in state T of the composite system is 

tr[E k J] = J dqdQdq'dQ'E k (q,Q;q',Q')T(q',Q';q,Q) . (4.7) 

We assume that the composite system S + A is prepared in a factorised state 

T = T 5 <g) T_4 (4.8) 

and express the probability (14. 7p in terms of the state T5. The action of the pro- 
jections E A ([afc„, a k +\) and E B ([6 fc _, b k+ }) on vector states of the form ^/(q,Q) is 

E A ([a fe _, a fc+ ])*(g, Q) = x[a fc _, a k+ }(q - Q)V(q, Q) , 

where 

x[a k -,a k+ }(x) = 1 Vx e [a k ^,a k+ ] , 
X[a k -,a k+ }(x) = Vxnot G [a k _,a k+ ] 
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and 



E B ([6 fc _A + ])*(g,Q) = (2nh)- 2 J dpdPdq'dQ' 



where 



exp l -\p(q - q') + P(Q - Q')} X [bk-, &*+](? + Q' 



X[bk~,b k+ }(x) = 1 Vx £ \bk-i b k +\ , 
x[b k -,b k+ ](x) = Va; not e [b k -,b k +] 



The trace f)4.7p can be calculated in several steps as follows. First, 

ir[E fc T] = ir[E A ([a fc _, a fc+ ])E B ([6 fe _, b k+ ])T] = tr[E B ([b k ^ b k+ })TE A ([a k _, a k+ ])] 
= (2nhy 2 J dqdQ dpdPdq'dQ' exp ^\p(q - q) + P(Q - Q')] 

X[h-, bk+](p + P)x[a>k-, a k+](q ~ Q)T(q', Q'\ q, Q) 
Second, introduce new integration variables q, a, p, b, q', a', 



tr[E k T] = (2nh) 2 J dq da dp db dq' da' 



exp-\p(q-q')+(b-p)(q-a-q'+a')]x[b k -,b k+ ](b)x[a k -, a k+ ](a) T(q', q'-a'; q, q-a) . 

Third, if the cells are sufficiently small, the integrands do not change appreciably 
inside the integration intervals of a and b so that they can be approximated as 
follows: 

dadbx[ak-,ak+}(a)x[b k -,bk+}(b)f(a,b) « S k f(a k ,b k ) 



In this way, we obtain 
tr[E fc T] « ^ 2 ^ 2 J dqdpdq'da' 



1 

exp -[{p{a k - a') + b k (q - q' - a k + a')} T(q', q' - a'; q,q- a k ) 

h 



But, fourth, the factor containing the integral over p is a 5-function, 

J dp exp jV{ a k — «') = 2irh5(a' — a k ) . 



Thus, we obtain, 



S f % 
tr[E k T) « , 2 jL 2 J dqdq' exp -b k (q - q') T(q', q' - a k ; q,q- a k ) . 
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Fifth, we use Eq. f l4~8j) . 

T(q',Q';q,Q) = T S (q',q)T A (Q',Q) , 

with the result 

Proposition 18 The probability of the outcome k to be found in the factorised state 
fi4-8^ is given approximately by 



tr[E k T] fa tr 

where 



Ts j2^ TAakA 



(4.9) 



1 

T4K, b k ] = exp -b k (q - q') T(q', q' -a k ;q,q- a k ) 

is the state first shifted by a k and then boosted by —b k . The approximation 
improves if the cells are smaller. 

If the cells are sufficiently small, we have q ~ Q + a k and p ~ b k — P- In this way, 
the coordinate of S is shifted by a k while the inverted momentum is shifted by —b k 
with respect to those of A. 

Eq. (I4.9p shows that there is an 'effective' POV measure E$ k for system S defined 

by 

Esk = j2^ JA[akM 
that yields the probability of the outcome k of the above registration considered as 
a registration performed on S. 

The state is completely arbitrary. To construct a useful quantity, one usually 
chooses a vector state in the form of a Gaussian wave packet (see, e.g., Ref. P- 
418), 

m a = (7nx 2 r 1/4 exp 



Q 2 



2a< 



Easy calculation yields 



(Q) = (P> = 0, AQ = 4=, AP h 



y/2 ' aV2 

so that is a state of minimum uncertainty. If we shift \& a by a k and then boost 
it by — b k , we obtain 

MUa fc A] = (^ 2 )- 1/4 exp ^ ~ {hQ^j , 

which is the Gaussian packet concentrated at (a k , —b k ), 

(Q) = a k , (P) = -b k , (4.10) 
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and 

As the Gaussian wave packet is uniquely determined by its averages and variances, 
we can interpret the observable as giving the probability that the corresponding 
registration applied to state T5 detects the state of S with the averages and variances 
given by Eqs. ( HTTP]) and f ETTTj) . 
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Chapter 5 

Quantum models of classical 
properties 

Quantum mechanics is a genuinely statistical theory. A well-defined repeatable 
preparation procedure defines an ensemble. Values of any observable obtained by 
repeated registrations on individual elements of the ensemble satisfy statistical laws 
that are given by quantum mechanics. Moreover, in general, values of observables 
are not objective, i.e., the individual elements of the ensemble cannot be assumed 
to possess determinate values of their observables before or independently of regis- 
trations and the results of individual registrations cannot be predicted. 

On the other hand, basic physical theories that describe classical world, i.e. me- 
chanics, electrodynamics, theory of gravitation, are objective and deterministic in 
character. This comprises three assumptions. First, the states are determined by 
values of observables and can be absolutely sharp (they contain no random elements) 
and, second, the sharp states are objective in the sense that any system always really 
is in one of them. Fuzzy states are also possible. They have the form of probability 
distributions of sharp states and are due to incomplete knowledge. Third, dynamical 
laws determine a unique evolution of all states. 

More generally (i.e., even for thermodynamics), there is a criterion of objectivity, 
the so-called Modified Principle of Macroscopic Realism^ (MPMR) [29] : 

1. A macroscopic system which has available to it two or more distinct classical 
states is at any given time in a definite one of those states. 

1 The original Principle of Macroscopic Realism has been formulated by Leggett [44]. We have 
replaced Leggett's 'macroscopically distinct (quantum) states' by 'distinct classical states'. Indeed, 
if the 'macroscopically distinct (quantum) states' include pure quantum states, point 1 of Leggett's 
macroscopic realism violates the principle of superposition. Then, one has to assume that some as 
yet unknown phenomena exist at the macroscopic level which are not compatible with standard 
quantum mechanics (see, e.g., |44j and the references therein). However, no such phenomena have 
been observed. 
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2. It is possible in principle to determine which of these states the system is in 
without any effect on the state itself or on the subsequent system dynamics. 

To obtain classical theories as some limiting cases of quantum mechanics or to 
construct quantum models of classical properties is a difficult problem. As yet, only 
partial and incomplete solutions to this problem exist. Let us now briefly review 
the most popular approaches to the problem. The shortcomings of the approaches 
are well known [22] [5] [70]. We mention them only fleet ingly. First, the quantum 
decoherence theory [76l 175] works only if certain observables concerning both the 
environment and the quantum system cannot be measured (see the analysis in [22] 
ITS]). The deep reason is that one works with values of observables. Second, the 
theories based on coarse-grained operators [56] [60], HTJ: the problem is the same as 
with the decoherence. For example, the Leggett-Garg inequality [JT] is a condition 
for the validity of the principle of macroscopic realism that works with values of 
observables. Third, the Coleman-Hepp theory [31] [6] [11] and its modifications [65] 
[62]: they are based on some particular theorems that hold for infinite systems but 
do not hold even approximately for finite ones (see the analysis in [6]). 

If we turn from theory to experiment, we may notice that any well-founded scien- 
tific observation of classical properties always has a statistical form. A measurement 
or observation is only viewed as well understood, if it is given as an average with a 
variance. This fact does not by itself contradict the deterministic character of the 
corresponding theory. The usual excuse is that the observation methods are beset 
with inaccuracy but that improvement of techniques can lead to better and better 
results approaching the 'objective' values arbitrarily closely. 

This popular excuse is clearly incompatible with the assumption that the classical 
world is only an aspect of a deeper quantum world and that each classical model 
is nothing but a kind of incomplete description of the underlying quantum system. 
If we assume such universality of quantum theory, then the statistical character of 
classical observational results must not only be due to inaccuracy of observational 
methods but also to genuine uncertainty of quantum origin. This point of view is 
originally due to Exner [2TJ, p. 669, and Born [T2] and will be adopted here as a 
starting point of our theory of classical properties. 

5.1 Modified correspondence principle 

Born- Exner assumption has quite radical consequences that are only seldom realised. 
First, the exactly sharp states and trajectories of classical theories are not objective. 
They do not exist in reality but are only idealisations. What does really exist are 
fuzzy states and trajectories. The objectivity of fuzzy states in classical world is a 
difficult point to accept and understand. Let us explain it in more detail. 
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In quantum mechanics, the basis of objectivity of dynamical properties is the 
objectivity of the conditions that define preparation procedures. In other words, if 
a property is uniquely determined by a preparation, then it is an objective property. 
If we look closely, the only hindrance to try the same idea in classical theories is just 
the custom always to speak about initial data instead of preparations. An initial 
datum is nothing but a state, more or less sharp one. The question on how an 
exactly sharp state can come into being is ignored. This in turn seems justified by 
the hypothesis that sharp states are objective, that is, they just exist by themselves. 

Thus, one has to accept that preparation procedures play the same basic role 
in the classical as in quantum physics. Then, the nature and form of necessary 
preparation procedures must be specified and the corresponding states described. 

Our starting point was that all physical systems are quantum systems. Let us now 
make it more precise. There is one level of description (approximative model of some 
aspects of a physical system), which determines a classical system S c and its classical 
properties for which quantum theory is not needed, namely the classical description. 
In addition, the same system can also be understood as a quantum system S q such 
that the classical properties of S c are some objective quantum properties of S q . This 
follows from the fact that all classical properties can be assumed to be objective 
without any danger of contradictions. S q is a quantum model of S c . 

The construction of quantum model S q consists of the following points. 1) The 
composition of S q must be defined. 2) The observables that can be measured on 
S q arc to be determined. On the one hand, this is a non-trivial problem: there 
are relatively strong restrictions on what observables of macroscopic systems can be 
measured (Sees. 2.2.2 and 6.2). On the other, as any observable is measurable only 
by a classical apparatus, the existence of such apparatuses is tacitly assumed from 
the very beginning. Quantum model S q will thus always depend on some classical 
elements. This does not mean that classicality has been smuggled in because, in our 
approach, classical properties are specific quantum ones. 3) A Hamiltonian operator 
of the system must be set up. Finally, the known classical properties of S c must 
be listed and each derived as an objective property of S q from the three sets of 
assumptions above. This is a self-consistent framework for a non-trivial problem. 

It follows that there must be some at least approximate relation between classical 
observables of S c and quantum observables of S q as well as between the classical 
states of S c and quantum states of S q . The following model assumption on such a 
relation might be viewed as our version of Correspondence Principle, let us call it 
Modified Correspondence Principle (MCP). 

Assumption 4 First, the state of classical system S c in a given classical theory is 
always described by a set ofn numbers {ai, • • • , a n } (state coordinates) that represent 
values of some classical observables. Let us assume that there are preparation proce- 
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dures that make the fuzziness of real classical states negligible. This can be described 
by considering the values {ai, • • • , a n } as averages with variances {Aai, ■ • • , Aa n } 
that are relatively small, 

aj fcl r ; J < 1 . 5.1 

5W/i states are the best we can achieve on the way to sharp classical states. 

Second, we assume that there is a subset {ax, • • • , a n } of sharp observables of 
quantum system S q and a state T such that 

tr[Ta fc ] = a k , ^tr[Ja 2 k ) - (tr[Ta fc ]) 2 = Aa k . (5.2) 

All such states can form a large subset of T(HsJ|" . Each of them can be considered 
as a quantum counterpart of the original classical state. 

Third, there are fuzzy classical states with a probability distribution p(a±, • • • , a n ) 
that either need not have negligible variances or need not be determined by its aver- 
ages and variances, or both. However, we assume that every classical state of this 
kind results from a preparation procedure such that the resulting state T that can be 
interpreted as a gemenge of some states with small variances and that p(oi, • • • , a n ) 
can be well approximated by the coefficients of such a gemenge. 

Let us give an example. The classical state of a mass point in mechanics can 
be described by three coordinates q k = a k and three momenta p k = a^+k, and the 
operator algebra of its quantum model contains the operators q k = a fc and p k = a 3+fe . 

Of course, we can define a sensible classical state only for macroscopic systems so 
that their classical states have a huge number of quantum counterparts and in this 
way contain much less information than their quantum states. Each of the many 
quantum states satisfying the above equations can be viewed as representing one 
and the same classical state. 

Clearly, MCP does not need the assumption that all observables from {ai, • • • , a n } 
commute with each other. Then, even if they themselves are not jointly measurable, 
their fuzzy values can be (see Sec. 4.4). 

Moreover, it does not follow that each classical property is an average of a quan- 
tum operator. That would be false. We assume only that the classical state coordi- 
nates ai, a n can be chosen in such a way. 

It is important to realise that MCP is just compatible with but does not imply 
MPMR. It suggests the way of how quantum models of classical properties can be 
constructed. In each such model, validity of MPMR ought to be achieved indepen- 
dently. 

We emphasize that our approach to classical properties is very different from 
the well-known WKB approximation method. In particular, pure states such as 
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coherent ones that have averages with small variances do not satisfy MPMR. Not 
only are pure states readily linearly superposed but also any quantum registration 
that were to find the parameters of a coherent state (a generalized measurement: 
positive operator valued measure) would strongly change the state (for a general 
argument, see Ref. [16J, p. 32). 

An interesting subset of classical properties of macroscopic system are the ther- 
modynamic ones. They are important for us because quantum models of these 
properties are available. Moreover, quantum models of this kind of classical proper- 
ties are compatible with Copenhagen interpretation [19]. Existing models based on 
statistical physics need one non-trivial assumption: the states of sufficiently small 
macroscopic systems that we observe around us are approximately states of maxi- 
mum entropy. As it has been discussed in Sec. 3.1, entropy is an objective property 
of quantum systems because it is defined by their preparation. Thus, the validity 
of thermodynamics depends on the preparation conditions, or the origin, of ob- 
served macroscopic systems. The averages that result from the models based on 
the maximum-entropy assumption agree with observations and have small variances 
spontaneously. In this way, they comply with MCP. We describe a simplified model 
in the next section. 

The physical foundations of thermodynamics are not yet well-understood but 
there are many ideas around. They might follow partially from logic (Bayesian 
approach, [36]) and partially from quantum mechanics (thermodynamic limit, [69], 
Vol. 4). Some very interesting models of how maximum entropy quantum states 
come into being are based on entanglement [231 EU EH 126]). 

In any case, statistical methods can be generalised as follows [30] • We assume 
that the maximum entropy principle underlies all classical properties: 

Assumption 5 An overwhelming part of macroscopic systems occur in quantum 
states that approximately maximize entropy under the condition of given averages of 
some quantum observables. 

Let us briefly compare our approach with the theories mentioned at the beginning 
of this section. It starts with the idea that all classical properties of a macroscopic 
system S in a quantum state T are certain objective quantum properties of S in 
T. Then, first, objective properties are quantum properties of all quantum systems 
and there is no question about how they emerge in quantum mechanics. This can 
avoid e.g. the artificial construction in the Coleman-Hepp approach. The new point 
is that they are considered as, and proved to be, objective [29]. Hence, second, 
they could in principle serve as classical properties because they can satisfy MPMR. 
This avoids the problems of both the quant um-decoherence and the coarse-grained 
theory that assume values of quantum observables to be objective. Such an as- 
sumption, as analysed in [15], works only for restricted classes of observables, all 
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other measurements being forbidden without an independent justification. Third, 
we conjecture that certain macroscopic quantum systems possess objective proper- 
ties, either structural or dynamical, that can fully model their classical behaviour. 
Hence, classical states and properties defined in the present paper are available only 
for some quantum systems and the relation between classical and quantum states 
is not one-to-one but one-to-many. This is different from other approaches such as 
Wigner-Weyl-Moyal scheme, quantum-mechanics-on-phase-space theory or coherent 
state approach. Finally, our construction of classical properties uses the way analo- 
gous to that of statistical physics. Thus, our MCP is a basis of modelling classical 
properties of quantum systems. 

5.2 A model of thermodynamic properties 

Here, we shall construct the length of a body as an average value with a small 
variance. No original calculation is to be expected, but simple and well known 
ideas are carefully interpreted. This entails that, first, the quantum structure of 
the system must be defined, second, the structural properties such as the energy 
spectrum calculated, third, some assumptions on the state of the system done and 
fourth, some dynamical properties derived that satisfy our definition of classical 
property. 

5.2.1 Composition, Hamiltonian and spectrum 

We shall consider a linear chain of N identical particles of mass \x distributed along 
the x-axis with the quantum Hamiltonian 



involving only nearest-neighbour elastic forces. Here operator x n is the position, 
operator p n the momentum of the n-th particle, k the oscillator strength and £ 
the equilibrium interparticle distance. The parameters /x, k and £ are structural 
properties (the last two defining the potential function). 

This kind of chain seems to be different from most that are studied in litera- 
ture: the positions of the chain particles are dynamical variables so that the chain 
can move as a whole. However, the chain can still be solved by methods that are 
described in [2SJ El] • 

First, we find the variables u n and q n that diagonalize the Hamiltonian and define 




(5.3) 
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normal modes. The transformation is 

N-l 



m=0 ^ ' 



(5.4) 



and 



iV-1 



where the mode index m runs through 0, 1 
matrix; for even m, 



Pn = Yl Om, (5-5) 

, N — 1 and Y™ is an orthogonal 



m=0 



7rm 



F n m = A(m) cos 

while for odd m, 

Y™ = A{m) sin 
and the normalization factors are given by 

1 



n — 



~N~ 



N + 1 
2 

JV + 1 



n — 



(5.6) 



(5.7) 



A(0) 



(5. 



To show that u n and q n do represent normal modes, we substitute Eqs. (15. 4p and 
(15. 5p into (15.31) and obtain, after some calculation, 



N-l 



N-l 



LI 1 \ ^ 2 , Z 1 \ ^ 2 2 



m=0 m=0 

which is indeed diagonal. The mode frequencies are 

2k inn 
u m = sin — — . 
yii N2 



(5.9) 



Consider the terms with m = 0. We have coq = 0, and = 1/yN. Hence 



u 



N 1 " 1 



so that 



n=l 



U() 



n=l 



iVX, q 







AT 



where X is the centre-of-mass coordinate of the chain and P is its total momentum. 
The 'zero' terms in the Hamiltonian then reduce to 

2M 
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with M = NjjL being the total mass. Thus, the 'zero mode' describes a straight, 
uniform motion of the chain as a whole. The other modes are harmonic oscillators 
called 'phonons' with eigenfrequencies u m , m = 1, 2, . . . , N—l. The energy spectrum 
of our system is built from the mode frequencies by the formula 

N-l 

E = ^2v m hw m , (5.10) 

m=l 

where {z/ m } is an (N — l)-tuple of non-negative integers — phonon occupation num- 
bers. 

At this stage, a new and important assumption must be done. We imagine that 
all states T of the modes m — 1, • • • , TV — 1 are prepared that have the same average 
internal energy E, 



Tr 



p2 

T[H- 



E. 



2M / 

We further assume that it is done in a perfectly random way, i.e., all other conditions 
or bias are to be excluded. Hence, the resulting state must maximize the entropy. 
In this way, the maximum of entropy does not represent an additional condition but 
rather the absence of any, see Sec. 3.1. The resulting state Te is the Gibbs state of 
the internal degrees of freedom. The conditions that define the preparation of Gibbs 
state are objective and need not be connected with human laboratory activity. 

The internal energy has itself a very small relative variance in the Gibbs state; 
this need not be assumed from the start. Thus, it is a classical property. All other 
classical internal properties will turn out to be functions of the classical internal 
energy. Hence, for the internal degrees of freedom, E forms itself a complete set of 
state coordinates introduced in Assumption 4. 

The mathematics associated with the maximum entropy principle is variational 
calculus. The condition of fixed average energy is included with the help of Lagrange 
multiplier denoted by A. It becomes a function X(E) for the resulting state. As it 
is well known, X(E) has to do with temperature. 

The phonons of one species are excitation levels of a harmonic oscillator, so we 
have 

— (a m + a m ), 

where a m is the annihilation operator for the m-th species. The diagonal matrix 
elements between the energy eigenstates | z/ m ) that we shall need then are 

(u m \u m \u m ) = 0, (v m \u 2 m \v m ) = (2is m + 1). (5.11) 

ZflU m 

For our system, the phonons of each species form statistically independent sub- 
systems, hence the average of an operator concerning only one species in the Gibbs 
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state Tg of the total system equals the average in the Gibbs state for the one species. 
Such a Gibbs state operator for the m-th species has the form 



m) (V I 



u m =0 

where 

P ( Z ] = Z m eX P (-^Aw m Z/„ 

and Z m is the partition function for the m-th species 



Z m (A) = J2 ^ mVm = x _l-^ m ( 5 - 12 ) 

u m =0 

5.2.2 The length of the body 

The classical property that will be defined and calculated in our quantum model is 
the average length of the body. Let us define the length operator by 

L = xn — Xi. (5.13) 

It can be expressed in terms of modes u m using Eq. ( 15. 41) . 

N-l 



m=0 



The differences on the right-hand side are non-zero only for odd values of m, and 
equal then to — 2Y™. We easily find, using Eqs. (15. 7p and (15. 8ft : 

m=l ^ ' 

The average length is obtained using Eq. (15. lip . 

(L) E =(N-1)£. (5.15) 

It is a function of objective properties N, £ and E. 

Eq. (I5.14p is an important result. It shows that contributions to the length are 
more or less evenly distributed over all odd modes. Such a distribution leads to a 
very small variance of L in Gibbs states. A lengthy calculation |29j gives for large 
N 

(\-)e ttk£V\VN 
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Thus, the small relative variance for large N need not be assumed from the start. 
The only assumptions are values of some structural properties and that an average 
value of energy is fixed. In the sense explained in Section 5.1, the length is then a 
classical property of our model body. We have obtained even more information, viz. 
the internal-energy dependence of the length (in this model, the dependence is triv- 
ial). This is an objective relation that can be in principle tested by measurements. 

Similar results can be obtained for further thermodynamic properties such as 
elasticity coefficient, specific heat etc. They all are well known to have small vari- 
ances in Gibbs states. The reason is that the contributions to these quantities are 
evenly distributed over the normal modes and the modes are mechanically and sta- 
tistically independent. Further important classical properties are the mechanical 
ones: centre of mass and total momentum. In fact, these quantities can be chosen 
as the remaining state coordinates for the whole chain. The contributions to them 
are evenly distributed over all atoms, not modes: the bulk motion is mechanically 
and statistically independent of all other modes and so its variances will not be small 
in Gibbs states. Still, generalized statistical methods can be applied to it. This is 
done in the next section. 

The last remark is that the thermodynamic equilibrium can settle down starting 
from an arbitrary state only if some weak but non-zero interaction exists between 
the phonons. We assume that this can be arranged so that the influence of the 
interaction on our result is negligible. 



5.3 Maximum entropy assumption 
in classical mechanics 

Let us start with the warning that the subject of this section has nothing to do with 
what is usually called 'statistical mechanics'. 

If one is going to model classical mechanics then what are the properties that one 
would like to reproduce? The most conspicuous property from the point of view of 
quantum mechanics appears to be the sharpness of mechanical trajectories in the 
phase space because quantum mechanics denies the existence of such trajectories. 
This leads most researchers to aim at quantum states the phase-space picture of 
which is as sharp as possible. That are states with minimum uncertainty allowed 
by quantum mechanics. For one degree of freedom, described by coordinate q and 
momentum p, the uncertainty is given by the quantity 

2AqAp 

U =— fi~> (5 - 17) 
where Aa denotes the variance of quantity a, as defined by Eq. (12. 3ft . 
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The states with v — 1 are, however, very special states. First, they must be pure 
states such as Gaussian wave packets or coherent states. Such states are very difficult 
to prepare unlike the usual states of macroscopic systems described by classical 
mechanics. They are also prone to strong distortion by measurements. Moreover, 
as pure states, they can be linearly superposed. This is another peculiarity that is 
never observed for states of systems of classical mechanics. Hence, trying to get a 
trajectory as sharp as possible entails the loss of other desirable properties. 

Moreover, the systems to which classical mechanics is applicable, are macroscopic. 
The overwhelming number of their degrees of freedom decouple from their mechan- 
ical ones, of which there are always only very few. The purely mechanical degrees 
of freedom have to do, e.g., with the motion of the centre of mass of a small but 
macroscopic body. As explained at the beginning of this chapter, the sharpness 
of phase-space trajectories is just an idealisation, a limit in which things become 
mathematically simpler. We can use it in calculations which, however, must also 
take into account the necessary non-zero variances of real observations. Indeed, 
such observations are generally afflicted with uncertainties v >> 1. Hence, if we 
want to compare the predictions of quantum models with observations of classical 
mechanics, we are forced to compare states that are fuzzy in both theories. 

Our main idea is to consider states with given averages and variances of coor- 
dinates and momenta, Q k , AQ k , P k , AP k , and leave everything else as fuzzy as 
possible. Thus, the averages of coordinates, of their squares, of momenta, and of 
their squares are fixed. To calculate the corresponding probability distributions in 
classical, and the state operators in quantum mechanics, we shall, therefore, apply 
the maximum entropy principle. The resulting states are called maximum- entropy 
packets, ME packets. The averages of coordinates and momenta take over the role 
of coordinate and momenta in classical mechanics. In any case these averages rep- 
resent measurable aspects of these variables. The dynamical evolution of variances 
is an important indicator of the applicability of the model one is working with. It 
determines the time intervals within which reasonable predictions are possible. 

Consider a three-body system that is to model the Sun, Earth and Jupiter in 
Newton mechanics. It turns out that generic trajectories starting as near to each 
other as, say, the dimension of the irregularities of the Earth surface will diverge 
from each other by dimensions of the Earth-Sun distance after the time of only 
about 10 7 years. This seems to contradict the 4 x 10 12 years of relatively stable 
Earth motion around the Sun that is born out by observations. The only way out 
is the existence of a few special trajectories that are much stabler than the generic 
ones and the fact that bodies following an unstable trajectory have long ago fallen 
into the Sun or have been ejected from the solar system. This spontaneous evolution 
can be considered as a preparation procedure of solar system. 
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As explained in Sec. 1.1.2, a prepared state can be viewed as objective. A simple 
example is a gun in a position that is fixed in a reproducible way and that shoots 
bullets using cartridges of a given provenance. The state of each individual shot is 
defined by the conditions and is the same for all shots even if observations may have 
different results for different shots. In the theoretical description of a state, we can 
make the limit of AQ k — > 0, — > 0. This is considered as a non-existing, but 
practically useful idealization. 

A finer analysis is possible only as long as new preparations are specified that 
determine a subset of individual shots. For example, if the conditions on the gun 
are that the gun is held by a human hand, then one cause of the relatively large 
AQ k and AP k is that the gun is always in a slightly different position because 
the hand and eye are not completely sure. The cause is 'classical' in its nature 
and one can imagine that each shot will be repeated several times more under the 
condition of a mechanical fixation. Then, there will still be variances but they may 
be much smaller. Thus, the original preparation can be considered as composed of 
sub-preparations in the way of gemenge. 

In general, if a part of the variances in a state come about as random or controlled 
changes of classical conditions in the preparation procedure, then the preparation is 
clearly composed of sub-preparations and the state a gemenge. Some variances will 
still remain, but they will be 'sufficiently' small for the corresponding states to be 
viewed as defined by the corresponding averages and negligible variances. 

To limit ourselves just to given averages and variances of coordinates and mo- 
menta is a great simplification that enables us to obtain interesting results easily. 



Let us first consider a system S with one degree of freedom (the generalization to 
any number is easy). Let the coordinate be q and the momentum p. A state is a 
distribution function p(q,p) on the phase space spanned by q and p. The function 
p(q,p) is dimensionless and normalized by 



where v is an auxiliary phase-space volume to make the integration dimensionless. 
The entropy of p(q,p) can be defined by 



The value of entropy will depend on v but most other results will not. Classical 
mechanics does not offer any idea of how to fix v. We shall get its value from 
quantum mechanics. 



5.4 Classical ME packets 
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5.4.1 Definition and properties 

Definition 21 ME packet is the distribution function p that maximizes the entropy 
subject to the conditions: 

(q) = Q, (q 2 ) = AQ 2 + Q\ (5.18) 

and 

(p) = P , (p 2 ) = AP 2 + P 2 , (5.19) 
where Q, P, AQ and AP are given values. 
We have used the abbreviation 

f dqdp 

(x) = / xp . 

J v 

The explicit form of p can be found using the partition- function method [30] . The 
resulting partition function for classical ME packets is 

Z=-^=exv(^ + 4-) ■ (5-20) 

The four Lagrange multipliers Ai, A 3 , A 2 and A 4 correspond to the four conditions 
(EEBD and (ESD - 

The generalization to any number of dimensions is trivial. 

Proposition 19 The distribution function of the ME packet for a system with given 
averages and variances Q\, • • • , Q n , AQx, • • • , AQ n of coordinates and Pi, ■ ■ ■ , P n , 
APi, • ■ ■ , AP n of momenta, is 

(Qk - Qk) 2 {Pk ~ Pk) 



2AQ 2 2AP t 



(5.21) 



For proof, see Ref. |30j. We observe that all averages obtained from p are indepen- 
dent of v and that the result is a Gaussian distribution in agreement with Jaynes' 
conjecture that the maximum entropy principle gives the Gaussian distribution if 
the only conditions are fixed values of the first two moments. 

As AQ and AP approach zero, p becomes a delta-function and the state becomes 
sharp. For some quantities this limit is sensible for others it is not. In particular, 
the entropy, which can easily be calculated, 

s = i + u 2 " AQAP , 

V 

diverges to — oo. This is due to a general difficulty in giving a definition of entropy 
for a continuous system that would be satisfactory in every respect. What one could 
do is to divide the phase space into cells of volume v so that AQAP could not be 
chosen smaller than v. Then, the limit AQAP — > v of entropy would make more 
sense. 
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5.4.2 Classical equations of motion 

Let us assume that the Hamiltonian of S has the form 

2 

H = h + v{q) ' (5,22) 

where fi is the mass and V(q) the potential function. The equations of motion are 

Q = U,H}, p={p,H}. 
Inserting ( I5.22p for H, we obtain 

4=5, P=~- (5-23) 
\i dq 

The general solution to these equations can be written in the form 

q{t) = q(t; q, p) , p{t) = p(t; q, p) , (5.24) 

where 

q(0;q,p) = q, p(0;g,p)=p, (5.25) 

q and p being arbitrary initial values. We obtain the equations of motion for the 
averages and variances if the initial state is an ME packet: 

Q(t) = (q(t;q,p)) , AQ(t) = a/ ((q(t; q,p) - Q(t)) 2 } (5.26) 

and 

P(t) = {p(t;q,p)} , AP(t) = y/((p(t;q,p)-P(t))*) . (5.27) 

In general, Q(t) and P(t) will depend not only on Q and P, but also on AQ and 
AP. 

Let us consider the special case of at most quadratic potential: 

V(q) = V Q + V iq + l -V 2 q\ (5.28) 

where Vk are constants with suitable dimensions. If V\ = Vi = 0, we have a free 
particle, if V2 = 0, it is a particle in a homogeneous force field and if V2 7^ 0, it is an 
harmonic or anti-harmonic oscillator. 

In this case, the general solution has the form 

q(t) = f (t) + qh(t)+pf 2 (t) , (5.29) 
p(t) = g Q (t)+qg 1 (t)+pg 2 (t) , (5.30) 
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where / (0) = / 2 (0) = g (0) = <?i(0) = and ^(0) = g 2 (0) = 1. If V 2 ± 0, the 
functions are 

Vi 1 

fo(t) = - — (1 -cosut) , = coswt , f 2 (t) = -smut , (5.31) 

V 2 K 

V\ 

go(t) = — £— sincut , gi(t) = — ^sinwt , g 2 (t) = cosut , (5.32) 
V 2 

where 

% 

Only for V 2 > 0, the functions remain bounded. If V 2 = 0, we obtain 



fo(t) = -^t\ j\(t) = l, f 2 (t) = -, (5.33) 

g (t) = -Vit, gi(t) = 0, g 2 (t) = l. (5.34) 

The equations for averages and variances resulting from Eqs. (15.241) . ( I5.18P and 
fl5TT9|) are [30] 

Q(t) = f (t)+Qf 1 (t)+Pf 2 (t), (5.35) 



AQ(f) = y/fKt)AQ 2 + /|(t)AP 2 , (5.36) 

P(t) = g Q (t) + Q 9l (t) + Pg 2 (t) , (5.37) 
AP(t) = yJft(t)AQ* + g*(t)AP*. (5.38) 

We observe: if functions ^2(^)5 fi'i(^) an d 92(f) remain bounded, the variances 

also remain bounded and the predictions are possible in arbitrary long intervals of 
time. Otherwise, there will always be only limited time intervals in which the theory 
can make predictions. 

In the case of general potential, the functions (I5.24p can be expanded in products 
of powers of q, p and t, and the averages of these products can be calculated by 
partition function method. In Ref. [SU], equations of motion are calculated up to 
fourth order. 



5.5 Quantum ME packets 

Let us now turn to quantum mechanics and try to solve an analogous problem. 
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5.5.1 Definition and properties 

Definition 22 Let the quantum model S q of system S has the basic observables q 
and p. State T that maximizes von Neumann entropy (see Sec. 3.1) 

S = -tr(T\nT) (5.39) 

under the conditions 

£r[Tq] = Q , tr[Tq 2 ] = Q 2 + AQ 2 , (5.40) 

ir[Tp] = P , tr[Tp 2 } = P 2 + AP 2 , (5.41) 
where Q, P, AQ and AP are given numbers, is called quantum ME packet. 

To calculate the state operator of a ME packet, one can use the partition func- 
tion method. However, it must be modified because the involved quantities do not 
commute, see Ref. [30]. The partition function for the quantum ME packets is 

exp (i| + i|) 

z = — v > -\ ■ ( 5 - 42 ) 

2 sinh(/j,vA 3 A4) 

The resulting state operator, generalised to n degrees of freedom is described by 
the following 

Proposition 20 The state operator of the ME packet of a system with given av- 
erages and variances Qi, ■ • • , Q n , AQx, • • ■ , AQ n of coordinates and P\, ■ ■ • , P n , 
APi, • ■ ■ , AP n of momenta, is 



t = II 



k=l 

where 



— 7 exp -- In -K k 

v v{-\ \ h v k - 1 



(5.43) 



K fc ^||(q fc -g fc ) 2 + ^( Pfc -P fe ) 2 (5.44) 
and 

2AP k AQ k 

v k = . (5.45) 

n 



For the proof, see Ref. [30J. The calculation of the state operator is simplified by the 
fact that K k is Hamiltonian operator of a harmonic oscillator. Of course, system S q 
for which the calculation is done, is completely general and its Hamiltonian operator 
has nothing to do with K k . 

Strictly speaking, the state operator (I5.43P is not a Gaussian distribution. Thus, 
it seems to be either a counterexample to, or a generalization of, Jaynes' hypothesis. 
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Everything can be decomposed into well-known eigenstates \m) of K. We have 
(n = l) 

oo 

K = J2 R m\m)(m\ . (5.46) 

fc=0 

We easily obtain for R m that 

(v - l) m 

R m = 2 / . (5.47) 

(z/ + l) m+1 



Hence, 



lim i? m — 5 m o 



and the state T becomes |0)(0|. In general, states \m) depend on v. The state vector 
|0) expressed as a function of Q, P, AQ and v is given by 



1 v 



vr2AQ 



1/4 



(g-Q) 2 + 



4Ag 2 ^ ^ y ft 



(5.48) 



This is a Gaussian wave packet that corresponds to other values of variances than 
the original ME packet but has the minimal uncertainty. For v — y 1, it remains 
regular and the projection |0)(0| becomes the state operator of the original ME 
packet. Hence, Gaussian wave packets are special cases of quantum ME packets. 

5.5.2 Quantum equations of motion 

Let the Hamiltonian of S q be H and the unitary evolution group be U(t). The 
dynamics in the Schrodinger picture leads to the time dependence of T: 



T(t) = U(t)TU(t) 



Substituting for T from Eq. f!5.43j) and using a well-known property of exponential 
functions, we obtain 

T(t) = -A T exp(-iln^±ju(t)KU(t)^ . (5.49) 

In the Heisenberg picture, T remains constant, while q and p are time dependent 
and satisfy the equations 

ift^ = [q,H], iA^=[p,H]. (5.50) 

They are solved by 

q(t) = U(f)+qU(f) , p(f) = U(f) f pU(t) , 
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where q and p are the initial operators, q = q(0) and p = p(0). The resulting 
operators can be written in the form of operator functions analogous to classical 
expressions (15.241) so that Eqs. (I5.26|) and (15.271) can again be used. 

The example with potential function ( 15. 28 ft is solvable in quantum theory, too, 
and we can use it for comparison with the classical dynamics as well as for a better 
understanding of the ME packet dynamics. Eqs. (15.501) have then the solutions 
given by (I5.29j) and (15.301) with functions f n (t) and g n (t) given by (I5.3ip and (I5.32j) 
or ( 15.331) and ( 15.341) . The calculation of the averages and variances is analogous to 
the classical one and we obtain Eqs. (15.351) and Eq. (I5.36H again. Similarly for p, 
the results are given by Eqs. (15.371) and (15.381) . 

We have shown that the averages and variances of quantum ME packets have 
exactly the same time evolution as those of classical ME packets in the special case 
of at-most-quadratic potentials. From formulae (I5.36P and (I5.38f) we can also see an 
interesting fact. On the one hand, both variances must increase near t = 0. On the 
other, the entropy must stay constant because the evolution of the quantum state 
is unitary. As the relation between entropy and v is fixed for ME packets, the ME 
packet form is not preserved by the evolution (the entropy ceases to be maximal). 
This is similar for Gaussian-packet form or for coherent-state form. 

For general potentials, there will be two types of corrections to the dynamics of 
the averages: terms containing the variances and terms containing fi. To see these 
corrections, equations of motion have been expanded in powers of q, p and t in Ref. 
|30j . The conclusion has been that the quantum equations begin to differ from the 
classical ones only in the higher order terms in V or in the higher time derivatives 
and the correction is of the second order in 1/V. This seems to be very satisfactory: 
our quantum model reproduces the classical dynamic very well. Moreover, Eq. (I5.48P 
shows that Gaussian wave packets are special cases of ME packets with v — 1. Thus, 
they approximate classical trajectories less accurately than ME packets with large 

5.6 Classical limit 

Let us now look to see if our equations give some support to the statement that 
v 3> 1 is the classical regime. 

The quantum partition function ( I5.42[) differs from its classical counterpart (I5.20p 
by the denominator sinh(^v . If 

h^/X^^l, (5.51) 

we can write 

sinh^v/^Al) = ft\/A^[l + 0((h^/h^) 2 )} 
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The leading term in the partition function then is 

r7 _TT 1 ( \\ A|\ 

where h = 2nh. Comparing this with formula (I5.20p shows that the two expressions 
are identical, if we set 

v = h . 

We can say that quantum mechanics gives us the value of v. Next, we have to 
express condition f)5.5ip in terms of the averages and variances. The usual relations 
between the Lagrange multipliers on the one hand and the averages and variances 
on the other (see Ref. [30]) imply 

= 7: In V -^~ ■ 
2 v — 1 

Hence, condition (15.51 j) is equivalent to 

v > 1 . (5.52) 

The result can be formulated as follows. Classical mechanics allows not only 
sharp, but also fuzzy trajectories and the comparison of some classical and quantum 
fuzzy trajectories shows a very good match. The fuzzy states chosen here are the 
so-called ME packets. Their fuzziness is described by the quantity v = 2AQAP/ h. 
The entropy of an ME packet depends only on v and is an increasing function of 
it. The time evolution of classical and quantum ME packets with the same initial 
values of averages and variances defines the averages as time functions. The larger v 
is, the better the quantum and the classical evolutions of average values have been 
shown to agree. Thus, the classical regime is neither AQ = AP = (absolutely 
sharp trajectory) nor v = 1 (minimum quantum uncertainty). This is the most 
important result of Ref. [SU] • The time functions coincide for the two theories in the 
limit v — > 00. Hence, in our approach, this is the classical limit. It is very different 
from the usual assumption that the classical limit must yield the variances as small 
as possible. Of course, v can be very large and still compatible with classically 
negligible variances. 

One also often requires that commutators of observables vanish in classical limit. 
This is however only motivated by the assumption that all basic quantum prop- 
erties are single values of observables. Within our interpretation, this assumption 
is rejected and if classical observables are related to quantum operators then only 
by being average values of the operators in prepared states. Then, first, all such 
averages are defined by a preparation and do exist simultaneously, independently 
of whether the operators commute or not. For example, Q and P are such simul- 
taneously existing variables for ME packets. Second, a joint measurement of fuzzy 
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values of non-commuting observables is possible. In Sec. 4.4, we have shown Eq. 
(14. 9p . which is a general formula valid for an arbitrary state of the ancilla. Then, 
we have chosen T^ to be a Gaussian wave packet, which has v = 1. However, T_4 can 
also be chosen as a quantum ME packet with Q = 0, P = and arbitrary AQ and 
AP allowing arbitrary large v. The resulting observable represents the detection of 
a shifted and boosted ME packet. 

Let us compare the present paper notion of classical limit with a modern textbook 
version such as Chap. 14 of [3]. Both approaches define the classical limit of a 
quantum state as a classical ensemble described by a fuzzy distribution function 
and calculate time evolutions of averages in the states. However, in the textbook, 
any quantum system, even not macroscopic, and any state, even pure, are allowed 
(pure states are preferred as they have smaller uncertainties). Hence, our notion is 
much narrower: we consider only macroscopic quantum systems and only some of 
their maximum entropy states. This has obvious physical reasons explained in Sees. 
5.1 and 5.3. 
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Chapter 6 



Quantum models of preparation 
and registration of microsystems 

Discussions about the nature of quantum measurement were started already by 
founding fathers of quantum theory, persisted throughout and seem even to amplify 
at the present time. 

An old approach to the problem of quantum measurement is Bohr's (its newer, 
rigorously reformulated version is Ref. [IE])- This approach denies that measur- 
ing apparatuses, and all classical systems in general, are quantum systems in the 
sense that all their properties can be derived from, or are compatible with, quan- 
tum mechanics. They must be described by other theories, called pretheories. Of 
course some classical properties of macroscopic systems can be obtained by quantum 
statistics. Ref. [49J shows that such occasional applications of quantum mechanics 
to classical systems are compatible with the form of denying the universality of 
quantum mechanics specified there. 

Modern approaches assume the universality of quantum mechanics together with 
various further ideas. A very brief account of the three most important examples is: 

1. Quantum decoherence theory [751 [76]. The idea is that system S+A composite 
of a quantum object and an apparatus cannot be isolated from environment 
£. Then the unitary evolution of S + A + £ leads to a non-unitary evolution of 
S + A that can erase all correlations and interferences from S + A that hinder 
the objectification (see the Introduction). However, then the correlations and 
interferences are just transferred to S + A + £ and the method can only work, 
if some bizarre assumptions are made about world, for instance one accepts 
some variant of the many world interpretation. 

2. Superselection sectors approach [34j [62]. Here, classical properties are de- 
scribed by superselection observables of A which commute with each other 
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and with all other observables of it (see Sec. 3.3). Then, the state of A after 
the measurement is equivalent to that with a suitable gemenge structure. The 
problems are that, first, the Hamiltonian of the measurement coupling can- 
not be an observable or else the measurement model does not work. Second, 
the only known physical mechanism for producing strict superselection rules 
occurs in systems having infinitely many degrees of freedom. However, mea- 
suring apparatuses do not satisfy this conditions and the superselection rules 
can therefore be only approximative. 

3. Modal interpretation [15]. One assumes that there is a subset of projections 
that have determinate values in the state (of S + A) after the registration. 
The set of projections must just be such that it does not violate contextuality 
(see Sec. 3.2.2), even if it can violate the existence of correlations and inter- 
ferences. Thus, one must postulate that the measurements of the correlations 
and interferences on the state are forbidden for some unknown reason. 

In general, the problem is far from being satisfactorily solved by any of the modern 
theories. Analysis of Refs. [22], [151 IS] > as we U as °f our previous papers [291 EQ] , give 
an account of their shortcomings. We adopt the definition of the problem and the 
proof that it is far from being solved from Ref. [IE], Chap. IV, where the above short 
points are dealt with in more detail and the corresponding references are given. 

Our approach to the problem is very different from anything that has ever been 
published in the field. It is based on a physical idea rather than on some spectacular 
mathematics. It seems to promise a complete solution to the problem. To explain 
the idea, we make some simplifying assumptions. First, we restrict ourselves to 
measurements performed on microsystems, i.e. particles and systems composite of 
few particles. There are other systems on which recently a lot of interesting ex- 
periments has been done, such as Bose-Einstein condensates, strong laser beams or 
currents in superconducting rings. Such quantum states of 'large' systems, mezo- 
scopic or even macroscopic, will be ignored here. Second, we use a simplified model 
of measurement. 

6.1 Beltrametti-Cassinelli-Lahti model 

In this section, we are going to recapitulate the well-known ideas on measurement 
that will be needed later. A summary is |16| . p. 25: 

. . . the object system S, prepared in a state T is brought into a suitable 
contact — a measurement coupling — with another, independently prepared 
system, the measurement apparatus from which the result related to the 
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measured observable is determined by reading the value of the pointer 
observable. 

In Ref . [16] , these ideas are developed in detail with the help of models. One of them, 
which we call Beltrametti-Cassinelli-Lahti (BCL) model (p. 38), is as follows. Let 
a discrete observable of system S of type r with Hilbert space H T be measured. 
Let Ok be eigenvalues and {<fi k j} be the complete orthonormal set of eigenvectors of 
0, 

04>kj = o k 4>kj ■ 

We assume that k = 1, • • • , N so that there is only a finite number of different 
eigenvalues o k . This is justified by the fact that no real registration apparatus can 
distinguish all elements of an infinite set from each other. It can therefore measure 
only a function of an observable that maps its spectrum onto a finite set of real 
numbers. Our observable is such a function. The projection E° on the eigenspace 
of o fc is then E£ = £\ \<p kj ) (<p kj \. 

Let the registration apparatus^! be a quantum system A with Hilbert space 
and an observable A. Let A be a non-degenerate, discrete observable with the same 
eigenvalues o k and with orthonormal set of eigenvectors ip k , 

Aip k = o k ip k 

with possible further eigenstates (such as ip °f Prop. 21) and eigenvalues. The 
projection on an eigenspace is E£ = \ip k ){ijj k \. A is the so-called pointer observable. 

Let the measurement start with the preparation of S in state T and the indepen- 
dent preparation of A in state T4. The initial state of the composed system S + A 
is thus T <S> T4. 

Let S and A then interact for a finite time by the so-called measurement cou- 
pling and let the resulting state be given by U(T <g) T^Ut, where U is a unitary 
transformation on H T <g> H^. 

The final state of the apparatus is Hs [U(T®T^)U^] , where Us is the partial trace 
over states of S. The first requirement on the model is that this state gives the same 
probability measure for the pointer observable as the initial state T predicted for 
the observable 0: 

tr[TE°]=tr[n <s [U(T®T^)Ut]E^] . 
This is called probability reproducibility condition. Now, there is a theorem [8]: 

Proposition 21 Let a measurement fulfil all assumptions and conditions listed 
above. Then, for any initial vector state ip of A, there is a set {(p k i} of unit vectors 

our language, a measurement consists of preparation and registration so that what Ref. [16] 
often calls 'measurement' is our 'registration'. 
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in H r satisfying the orthogonality conditions 



(VkilVkj) = fyj (6-1) 
such that U is a unitary extension of the map 

<pki ( S>i ! <-Pki ® 4>k ■ (6.2) 

The second requirement on the model is that it has to lead to a definite result. 
More precisely, the apparatus must be in one of the states \itk)(i^k\ after each indi- 
vidual registration. This is called objectification requirement^. Ref. [16] introduces 
a more general concept of measurement that leaves open whether the objectifica- 
tion requirement is satisfied or not. Such a procedure is called premeasurement. A 
measurement is then a premeasurement that satisfies objectification requirement. 

Suppose that the initial state of S is an eigenstate of 0, T = \(pki)(4>ki\, with the 
eigenvalue o*.. Then, Eq. (16. 2 j) implies that the final state of apparatus A is \tpk)(ipk\, 
and the premeasurement does lead to a definite result. However, suppose next that 
the initial state is an arbitrary vector state, T = \<f))(<f)\. Decomposing <p into the 
eigenstates, 

we obtain from Eq. (16. 2\\ 

$end = U(0®V) = ^VPhfl^^k , (6.3) 

k 

where 

= ; E ' CWyfc ' _ (6.4) 

(Si Y.j C kj¥kj) 



and 



is the probability that a registration of performed on vector state 4> gives the value 
Ok- The final state of apparatus A then is 

*r 5 [U(T® T^ut] = J2Vp~kVpi^MMk)(^i\ - (6.5) 



2 Ref. 16 calls this pointer objectification and introduces a further requirement concerning the 
state of S after the registration, value objectification (pp. 30, 49). As our approach denies that 
objectivity of observable values is necessary for realist interpretation, the value objectification is 
not an important requirement for us, but it will follow from the pointer objectification via strong 
correlations between observable and pointer values valid in some models. 
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Because of the orthonormality of |^fe)'s, the probability that the apparatus shows 
the value if A is registered on it in this final state is pk, which is what the 
probability reproducibility requires. However, if the objectification requirement is 
to be satisfied, the final state of the apparatus must be 

tr 5 [U(T®T^)Ut] = (j^j Pj \^){^\ (6.6) 

(it must be a gemenge, see Chapter 1, Sec. 1.2). Thus, in general we have a model 
of a premeasurement, but not of a measurement. 

6.1.1 Repeatable premeasurement and von Neumann model 

In order to define what a repeatable premeasurement is, we need the notion of state 
transformer. To this aim, let us first calculate the final state of system S after a 
BCL premeasurement is finished: 

k 

The part of the sum on the right-hand side corresponding to the result of premea- 
surement lying in the set X is 

X(X)(|0>(0|)= X>|rf>(rf| ■ (6-7) 

o k £X 

The right-hand side is not a state, because it is not normalised. Its trace is the 
probability that the result lies in X, 

p°(X) = tr[l(X)(T)} 

if the initial state of S is T. The quantity (l/pj(X))I(X) is an operation-valuecjfl 
measure and is called state transformer. For more details, see Ref. [T6] . 

Definition 23 A premeasurement is called repeatable if its state transformer sat- 
isfies the equation 

tr[l(Y)(X(X)(T))} = tr[l(Y n X)(T)] (6.8) 
for all subsets of possible values X and Y and all possible states T of S. 

That is, the repetition of the premeasurement on S does not lead to any new result 
from the probabilistic point of view. To see whether the state transformer (16. 7p 
satisfies Eq. (I6.8p . let us rewrite in the form: 

i{x){\4>){4>\) = pMHvW = E K ^>^ K l > 

o k ex o k ex 



Operation J is a map, J : T(H T )+ ^ T(H T )+, see Ref. [TB]. 
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where 

K fc = E \Pki)(4>ki\ ■ 

I 

One can show that this relation is general, 

X(X)(T) = K fc TKl , 

o k ex 

for proof, see Ref. [16]. We have then 

X(F)(X(X)(T)) = £ K, ( K fc TK iU Kj = E (K.K fc )T(K,K,)t . 
Eq. (16.81) would be satisfied if 

K^K fc = Ka^ , (6.9) 

which is in general not the case. 

Let us therefore restrict ourselves to measurement couplings satisfying 

4>m = ■ (6.10) 

This model is called von Neumann premeasurement because it was first described 
in Ref. pP - 

For von Neumann premeasurement, the operator is the projection Ejr? on the 
eigenspace of o^, 

i 

and Eq. (16. 9 j) is satisfied. Thus, von Neumann premeasurement is a special case of 
rep eat able premeasurement . 

The vector states tp\ given by Eq. (16. 4p are orthonormal for von Neumann pre- 
measurements. Thus, the final state of the apparatus given by Eq. (16. 5p reduces 
to (16. 6p . This is, however, not sufficient for the objectification requirement to be 
satisfied. The right-hand side of Eq. (16. 6p must be the gemenge structure of the 
state. If so, then Rule 11 implies that the final state of the composite system S + A 
must be 

i 

where T'- are some states of S. But $ end of (16. 3 p is a vector state. State operators of 
vector states admit only trivial decomposition. Hence, the objectification require- 
ment is not satisfied for von Neumann premeasurements, and it is therefore not a 
measurement. 

4 In fact, von Neumann premeasurement is slightly more general in the sense that it is a pre- 
measurement of a function /(O), where / need not be bijective, cf. Ref. [IB] . 
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An analogous difficulty holds for more general models of premeasurement de- 
scribed in Ref. [16] and the book contains more general no-go theorems. This is 
called problem of obj edification. The conclusion of Ref. [16] is that the von Neu- 
mann model of premeasurement, together with its generalisations, does not work as 
a model of measurement. Von Neumann himself postulated that measurements de- 
fine another, non-unitary and indeterministic kind of evolution in which S randomly 
jumps into one of the eigenstates of the measured observable (Ref. [51] . pp. 217, 
351). This was called collapse of the wave function by Bohm (Ref. [ID] , p. 120). 

6.2 Comparison with real experiment. 
Importance of detectors 

The theoretical models of the previous section ought to describe and explain at least 
some aspects of real experiments. This section will try to go into all experimental 
details that can be relevant to our theoretical understanding. 

First, we briefly collect what we shall need about detectors. Microsystem S to 
be detected interacts with the sensitive matter of the detector so that some part of 
energy of S is transferred to the detector. Mostly, S interacts with some subsystems 
of the sensitive matter exciting each of them because the excitation energy is much 
smaller than the energy of S. The resulting subsystem signals are collected, or 
amplified and collected so that they can be distinguished from noise. For example, 
in ionisation detectors, many atoms or molecules of the sensitive matter are turned 
into electron-ion pairs. If the energy of S is much higher than the energy of one 
ionisation, say about 10 eV, then many electron-ion pairs are produced and the 
positive as well as the negative total charge is collected at electrodes [15] . 

In the so-called cryogenic detectors |68j, S interacts, e.g., with superheated super- 
conducting granules by scattering off a nucleus in a granule and the phase transition 
from the superconducting into the normally conducting phase of only one granule 
can lead to a perceptible electronic signal. A detector can contain very many gran- 
ules (typically 10 9 ) in order to enhance the probability of such scattering if the 
interaction between S and the nuclei is very weak (weakly interacting massive par- 
ticles, neutrino). Modern detectors are constructed so that their signal is electronic. 
For example, to a scintillating matter, a photomultiplier is attached, etc., see Ref. 
H5J. 

In any case, in order to make a detector respond S must lose some of its energy 
to the detector. The larger the loss, the better the signal. Thus, most detectors 
are built in such a way that S loses all its kinetic energy and is absorbed by the 
detector (in this way, also its total momentum can be measured). Let us call such 
detectors absorbing. If the bulk of the sensitive matter is not large enough S can 
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leave the detector after the interaction with it, in which case we call the detector 
non-absorbing. Observe that a detector is absorbing even if most copies of S leave 
the detector without causing a response but cannot leave if there is a response (e.g., 
neutrino detectors). 

Suppose that S is prepared in such a way that it must cross a detector. Then, 
the probability of the detector response is generally rj < 1. We call a detector ideal, 
if rj = 1. 

An important assumption, corroborated by all experiments, is that a real de- 
tector either gives a signal or remains silent in each individual registration. This 
corresponds here to the objectification requirement. 

After these preparatory remarks, consider a kind of repeatable premeasurement 
that is often described in textbooks (see, e.g., Ref. |56j, p. 27, where it is called 
'repeatable test'), for example a Stern- Gerlach-like measurement of spin. Let co- 
ordinate system {x 1 ,^ 2 ,^ 3 } be chosen. Silver atoms evaporate in an oven O, form 
a beam B along x 2 -axis passing through a velocity selector V, and then through 
an inhomogeneous magnetic field produced by device A\\. Aii splits -Bo into two 
beams, B\ + and -Bi_, of which B\ + is associated with positive and B\_ with nega- 
tive spin x 1 -component, the corresponding vector states being denoted by |1+) and 
|1— ). Beam B\- is blocked off by a screen. This is the preparatory part of the 
experiment. 

Next, beam S 1+ runs through another magnetic device, Ai\p with centre at xm 
and finally strike an array of ideal detectors {.4.^} placed and oriented suitably 
with respect to Ai^ . Two detectors of array {A^ } respond, let us denote them 
by A + and A-, revealing the split of Bi + into two beams, -83+ and -B3-, caused by 
Ai^ . Let the orientation of Ai^ be such that B 3+ corresponds to positive and -B 3 _ 
to negative spin a; 3 -component, the associated states of silver atoms being |3+) or 
|3— }. The beams B 3+ and B 3 _ are spatially sufficiently separated when they arrive 
at the detectors so that they hit different detectors A+ and A-. In this way, the 
values of the quantum mechanical centres of mass of silver atoms at the moment of 
their interaction with the detectors are measured. These centres of mass denoted by 
X3 + and X3_ are registered by the detectors in a rather coarse-grained way, which 
is, however, perfectly sufficient to determine the spins of the atoms. Pointer values 
are the electronic responses of detectors A + and A_. Let us call experiment I what 
is performed by O, V, Mi, M*p and {A^}. 

Let us now remove {.4.jj. }, place device A4 3 of the same macroscopic structure 

and orientation as M^ with centre position a?™ in the way of B% + so that £> 3 _ 

(2) 

passes by and arrange array {A k } so that it has the same relative position with 
respect to AI3 as {AfP } had with respect to Ai^ . Now, only one detector will 
respond, namely that which is approximately at the position x 3+ — + X( 2 ). Let 
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us call experiment II what is performed by O, V, Ai\, Ai 3 , Ai 3 and {A k }. The 
result of experiment II is described in Ref. [SB] as 'two consecutive identical tests 
following each other with a negligible time interval between them ... yield identical 
outcomes'. 

Clearly, experiment II does not consist of two copies of experiment I performed 
after each other. The only repetition is that device Ai^ is placed after Ai 3 and 
has the same structure and orientation with respect to its incoming beam B 3+ as 
Ai 3 has with respect to B\ + . Device Ai^ splits B x+ into -83+ and B 3 _ but Ai z 
does not split B 3+ . One may say that it leaves B 3+ unchanged. Let us define the 
action of device Ai^ together with the choice of (±)-beam for each k = 1, 2 as a test 
(in the sense of Ref. [SB]) or a premeasurement. Let the outcomes be the thought 
responses of imaginary detectors placed in the way of the chosen beam. Then the 
(counterfactual) outcomes can be assumed to be identical indeed and we have an 
example of repeatable premeasurement that satisfies Definition 22. 

The procedures defined in this way are premeasurements that can be described 
by von Neumann model. The value of the spin is transformed by the magnetic 
field into the value x 3+ of its centre-of mass-positions that can be considered as the 
eigenvalues of the pointer observable associated with effects |3±)(3 ± |. However, 
the premeasurement cannot be considered as an instance of registration because it 
does not give us any information about the silver atoms. Try to suppose, e.g., that 
the arrangement measures effects |3±)(3 ± | depending on which of the outgoing 
beams is chosen. Now, how can we recognise whether the outcome is 'yes' or 'not'? 
There is no change of a classical property of an apparatus due to its interaction 
with a microsystem that would indicate which of the values x 3+ and x 3 _ results. 
But premeasurement is allowed not to give definite responses by each individual 
action. To obtain definite values, additional detectors are needed. Without the 
additional detector, however, this real premeasurement is not a measurement. 

The above observations about the role of detectors seem to be obvious. It is a 
strange fact, however, that the detectors are neglected or deemed irrelevant in old 
as well as in modern theoretical accounts of quantum measurement. For example, 
Ref. [55], p. 64, describes a measurement of energy eigenvalues with the help of 
scattering similar to Stern-Gerlach experiment, and it explicitly states: 

We can consider the centre of mass as a 'special' measuring apparatus... 

Similarly, Ref. [SB], p. 17 describes Stern-Gerlach experiment: 

The microscopic object under investigation is the magnetic moment fi 
of an atom.... The macroscopic degree of freedom to which it is coupled 
in this model is the centre of mass position r... I call this degree of 
freedom macroscopic because different final values of r can be directly 
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distinguished by macroscopic means, such as the detector... From here 
on, the situation is simple and unambiguous, because we have entered 
the macroscopic world: The type of detectors and the detail of their 
functioning are deemed irrelevant. 

In our opinion, this notion of measurement is erroneous and leads directly to von 
Neumann model and to the problem of quantum measurement. 

Suppose next that there are non-absorbing ideal detectors that do not disturb the 
spin state of the atom. This might work, at least approximately. Let experiment I' 
be the same as I with the only change that the array {.4.^} is replaced by {*4^ 1 ' ) } 
containing the non-absorbing detectors. Let experiment IF starts as F and proceeds 
as II but with {*4.j^} replaced by {*4| < ' 2 ' ) } made from the non-absorbing detectors. 
Clearly, the action of iM 3 + {A p k ^}) for each j — 1, 2 is a repeatable premeasure- 
ment according to Definition 22, and it is even a repeatable measurement because 
of the responses of the real detectors, but it definitely cannot be described by a von 
Neumann theoretical model. For the detectors to respond, some part of the energy 
of the atoms is needed, so that condition (I6.1(jp is not satisfied. 

An interesting difference emerges here between what we can say about the system 
(silver atom) on the one hand and about states on the other in their relation to the 
beams B 3+ and B 3 _. Whereas B 3+ is associated with |3+) and £> 3 _ with |3— ), each 
atom is in a linear superposition of the two states |3+) and |3— ) that equals to the 
prepared state One cannot even say that all atoms in beam B 3+ are in state 

1 3+) because no atom is just in B 3+ . Unlike the states, the atoms are not divided 
between the beams. Indeed, the two beams could be guided so that no detectors are 
in their two ways and that they meet each other again. Then, they would interfere 
and if the two ways are of equal length, so that no relative phase shift results, the 
original state |1+) would result. This would happen even if the beams are very thin, 
containing always at most one silver atom. Hence, each atom in its wave aspect had 
to go both ways simultaneously. 

Let us however observe that each of the beams B 3+ and B 3 _ by itself behaves as if 
it were a prepared beam of silver atoms in a known state, which are |3+) and |3— ), 
respectively. The fact that we place some arrangement A of devices that do not 
contain any detector in the way of beam B 3+ and leave B 3 _ alone justifies our use of 
state in |3+) in all calculations of what will be the outcome after arrangement A is 
passed. The voluntary element of beam choice in this experiment can be interpreted 
as a preparation or as a reselection of ensemble, but not as a collapse of the wave 
function. Indeed, the whole outcome will be a linear superposition of states in each 
of the two beams at the time the upper beam passes A. Only if we put any detector 
after A or, for that matter, a detector or just a screen into the way of -B3-, then 
something like a collapse of the wave function can happen. 
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As yet, we have considered only the so-called direct registrations. A complete 
theory of registration must distinguish two kinds of registration. 

A direct registration first manipulates S by classical fields and screens so that the 
prepared beam is split into spatially separated beams, each of which associated with 
one eigenvalue of the registered observable. Then, there is a set of detectors each of 
which can be hit by only one beam. 

An indirect registration, such as scattering or QND measurement (see e.g. [13] 
and [56] p. 400), lets S interact with an auxiliary microsystem S[ and it is only 
S[ that is then subject to a direct registration. For the measurement to be QND, 
several further conditions must be satisfied, but this does not concern us here. After 
a QND procedure, S remains available to another one: another system S' 2 of the 
same type as S[ interacts with S and is then directly registered etc. Information 
given by the detectors of the direct registrations reveals also something about S. 
Thus, detectors are necessary for indirect registrations, too. 

The analysis of the present section motivates the following generalisation. First, 
an arrangement of devices that acts in agreement with von Neumann model of direct 
premeasurement is not a registration apparatus. Second: 

Assumption 6 Any registration apparatus for microsystems must contain at least 
one detector and every reading of a pointer value is a signal from a detector. 

Assumption 6 seems to be obvious but it has far reaching consequences. 

As explained in Sec. 2.2.4, a preparation of a quantum object S of type r must 
separate it from other systems of type r in a space domain D. Assumption 6 implies, 
however, that this need not be sufficient for S to be an object on which registrations 
can be performed. Indeed, on the one hand, any direct registration first manipulates 
S individually by fields and screens and then detect it by a detector. For example, 
the scattering of electrons on a crystal of graphite can be described as manipulating 
each electron by an effective potential (=field). Also, S must have sufficient kinetic 
energy to excite the detector. On the other hand, an indirect registration on S 
is only possible if S is sufficiently isolated so that the auxiliary microsystem must 
interact with S rather than with any other microsystem. 

It follows: For S to have any observable, it must be prepared 

• in a domain D that separates S from other microsystems not necessarily of 
the same type 

• so that either S is available to manipulation by fields and screens^] and has 
sufficient kinetic energy to excite a detector, in the case of a direct registration, 

5 A screen can be theoretically described as a boundary condition that absorbs everything and 
releases nothing. 
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• or S is the only microsystem that can interact with the auxiliary microsystem 
in the case of an indirect one. 

Definition 16 and Assumption 6 imply that both preparations and registrations 
of microsystem S are processes during which the separation status of S changes. 
In particular, in order that observable E of any microsystem S be registered, S 
must enter the sensitive matter part M. of the corresponding detector. hA contains 
subsystems of the same type as those that are also contained in S, either because 
they were there already before the detection started or because M. becomes polluted 
by them during the experiment. Moreover, Ai hinders individual manipulation by 
fields and screens that would be necessary for each given observable of S to be 
registered. 

Assumption 6 also leads to restrictions on observables of macroscopic systems. 
In general, a macroscopic system A is a composite quantum system with very many 
different microsystem constituents. One can subdivide these microsystems into type 
classes. Consider first observables that concern properties of microscopic subsystem 
iS of type r. If we apply the basic rules of observable construction for systems of 
identical microsystems (Sec. 2.2.1), then e.g. the position and momentum of any 
individual microsystem S are not observables of A but 'collective' operators such as 



where a(xk', x' k ) is the kernel of an operator in Q-representation acting on fc-th sub- 
system of type r, could be. Suppose that there is an apparatus B suitable to measure 
a(xk] x' k ) on an individually prepared system S. One can imagine that applying B to 
A in some way similar to that described in Sec. 2.2.2 would measure a co i because all 
subsystems of type S would automatically contribute to the result of the measure- 
ment. However, in Sec. 2.2.2, the registration apparatus was applied to individually 
prepared subsystems. It follows from Assumption 6 that the apparatus B cannot 
be applied to A in this way because none of the identical subsystems of type r is 
prepared individually. Readings of B are signals of its detector and for any detector 
to register S, S must be isolated to be manipulable, have sufficient kinetic energy, 
etc. Hence, to measure collective observable (16. lip , we need a method that makes 
measurements directly on A. 

This holds for direct registrations. As an example of an indirect one consider 
scattering X-rays off a crystal A. By it, relative positions of its nuclei can be 
recognised (an indirect registration). But rather than a position of an individual 
nucleus it is a space dependence of the average nuclear density due to all nuclei. 
Such an average nuclear density could be obtained with the help of an operator 
of the symmetrised form (16. lip . In general, scattering of a microsystem S' off 




(6.11) 



k 
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a macrosystem A can be determined in terms of a potential Vk(x, £, Xk, £&) that 
describes the interaction between S' and one of the microscopic subsystems of A. 
The whole interaction Hamiltonian is then the sum analogous to (16. lip . 



which however extends over all microscopic subsystems of A that can interact with 
S'. It is also important to realise that there are very few interactions that can supply 
potentials useful for practical experiment. 

Other examples have to do with additive quantities, such as momentum and 
angular momentum. Total values of these quantities can be measured and they are 
of the form ( 16. lip , only the sum must extend to all microscopic subsystems of A 
independently of type. 

We notice, first, that an observable ) of a microsystem S can be pro- 

moted to a collective observable ( 16. lip of A if A itself admits a direct or indirect 
registration of it, which happens only in special and seldom cases. Second, such 
a collective observable is still too 'sharp', because only some averages with rather 
large variances can be observed. It is impossible to obtain its single eigenvalues as 
results of registration (for an example, see Ref. [56], p. 181). 

A different case is the kinetic energy of S. Again, it cannot be measured by the 
method kinetic energy is measured on individual systems of type S. In the special 
case that A is in thermodynamic equilibrium, the average of the kinetic energy of 
S would have the meaning of 1/ks times the temperature of A. Hence, a viable 
method to measure the average is to measure the temperature of A. Again, this is 
a very special case that works only under specific conditions. 

Thus, one of the consequences of Assumption 6 are principal and severe limits on 
promotion of POV measures to observables. 

6.3 A simple modification of BCL model 

Sec. 6.2 motivated the first idea relevant for the present section, namely that the 
reading of a registration apparatus for microsystems is in fact an electronic sig- 
nal from a detector. This gives us much clearer notion of registration apparatus. 
The second relevant idea comes from Sec. 2.2.4, namely that fixed-status quan- 
tum mechanics (FSQM) has its limits. This consequence of general rules of stan- 
dard quantum mechanics about indistinguishable microsystems leads to a significant 
modification of quantum theory of measurement. The necessary changes are: 

1. Each preparation of microsystem S must give S a non-trivial separation status 




A; 
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D, D 7^ 0. Thus, a quantum object comes into being only in preparatio: 



2. Object S must be prepared so that it can then be manipulated and controlled 
by devices within D such as electric and magnetic fields. This can be viewed 
as additional condition on separation status, see Ref. [32] . 

3. Macrosystem A such as a blocking screen, a scattering target or a detector 
that contains microsystems indistinguishable from S or from some subsystems 
of S must lie behind the boundary of D. Other microsystems of the same 
type as S that may be in D due, e.g., to imperfect vacuum must not disturb 
registration of S. Corrections to FSQM description of the behaviour of the 
composed system S + A due to a necassary separation-status change of S must 
be carefully chosen. 

The usual method of FSQM is to specify initial states of both S and A before 
their interaction, choose some appropriate interaction Hamiltonian and calculate 
the corresponding unitary evolution of the composed system S + A ignoring the 
problem with separation-status change. As shown in Sec. 6.1, the results are wrong 
for registration apparatuses. The purpose of this section is to make some corrections. 
First, we must modify and extend BCL model. 

Let system S of type r be prepared as a quantum object of separation status D 
in a domain D. Let be the observable to be registered with eigenvalues Ok and 
eigenvectors <fiki as in Sec. 6.1. Let S be prepared in state and manipulated so 
that its component eigenstates of become spatially separated. 

Next, we construct a sufficiently detailed model of detector A. Let A be an array 
of TV ideal monoatomic-gas ionisation detectors Ak similar to that of Sec. 6.2. Let 
index k enumerate the detectors and let the sensitive matter of each detector have 
separation status where fl D\ = for all k ^ I and fl D = for all k. Let 
the sensitive matter be a system of identical atoms. Let each atom be modelled by 
a particle with mass fi, spin zero and a further degree of freedom, ionisation, with 
two values, 'non-ionised' and 'ionised'. We simplify the model further by assuming 
that the ionisation and translation degrees of freedom define two different formal 
subsystems, A lon and A tT a, which can be decomposed in subsystems A^ on and A% Ta of 
Ak- Let Xkn be the state describing n ionised atoms in kth detector. The states 

JJ ®Xkn{k) 
k 

for all n(fc)'s form a basis of the Hilbert space of Ai on , where n(k) is a map from 
{1, ■ ■ • ,N} into non- negative integers. Let us assume that the initial state of A 10 n 

6 Thcrc is no problem for realism here. Even in classical physics, we often experiment with 
objects that are set up in the laboratory and need not exist before the experiment. 
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(6.12) 



k 



the perfectly non-ionised state. We can further assume that the initial state T tra 
of Atra, is close to a state of maximum entropy with sufficiently low temperature so 
that ionisations due to atomic collisions have a very low probability. 

Let each Ak further contains family S' k of systems of type r identical with S and 
let the number of such subsystems in S' k be M k . Thus, Ak = A^ on + *4| ra + S k Let 
the subsystems of S' k be distributed over the sensitive matter of the fc-th detector 
so that the separation status of S' k is Dk- We assume that the subsystems of S' k do 
not interact with each other, with the rest of the detector and with S. Let the state 
of S' k be Tfe. Hence, T fc satisfies 



where P T is the projection to the symmetric or antisymmetric subspace of H^K, 
depending on r, see Sec. 2.2.1. 

Let the measurement coupling be a coupling between S and the ionisation degree 
of freedom of each atom in the sensitive matter of the whole array. That is, S 
interacts directly only with A\ oa . In a single detector, after the ionisation of the 
first atom, all subsequent ionisations lie along a ray track inside the same detector. 
An explanation of the fact that e.g. a spherical wave can produce a straight track 
is given in Ref. p>3], where it is shown that the position of the track head, the first 
ionisation of the track, determines the track. This can be considered as a necessary 
property of every measurement coupling that is possible in the case considered here. 

Let our first attempt at the new rule be based on the assumption that the dynam- 
ical rule of the standard quantum mechanics holds. Under this assumption, let the 
measurement coupling fulfils the conditions of the BCL model, so that, if systems 
S' k did not exist, we would be lead to Eq. (16. 2p . where if) is given by Eq. (I6.12p . 



and a n are coefficients independent of k satisfying J2 n l a «| 2 = !• This * s again 
a simplifying assumption: each S creates always the same ionisation state in any 
detector. Then Eqs. (16.31) and (16.41) are valid. Now, we can define the observable 
A by its eigenstates (I6.12p with eigenvalue and ^'s with eigenvalues Ok assuming 
Ok 7^ for all k. 

In Sec. 6.1, states ipk were called 'end states' of A and A was called 'pointer ob- 
servable'. Here, we prefer to call A trigger observable and ipk trigger states because 



T, - p( M k)-r, p( M k) 

' k — ' T ' k' T j 
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there is a further evolution of A independent of S that leads from ip k to the concen- 
tration of charges at the electrodes and an electronic signal of k-th detector. This 
is due to a coupling between A{ on and At Ta , mediated by the electrostatic field of the 
electrodes: ionised atoms move in a different way than the non-ionised ones. This 
motion leads to atom collisions and further ionisation in a complicated irreversible 
process. Only then, the true end states with true pointer values are achieved. The 
pointer values are some averages with some variances, in agreement with the ex- 
pectation of Refs. j29j SO]- However, what is important for us happened already at 
the trigger stage and we can ignore the evolution from a trigger state to a detector 
signal. 

Let further the support of trigger states (p k [ of S be D k for all k, I. It follows then 
that, 

(<Pkl\<Pmn) = SkmSln ■ (6.13) 

Thus, we assume a special case of the BCL dynamics. Then, that Eq. ( 16. 6 j) would 
hold. 

The existence of systems S' k leads however to change of the separation status of 
S after S enters the sensitive matter of a detector. We can take this into account 
similarly as in Sec. 2.2.4 and replace state $ cn( j of system S + A ion by a suitably 
entangled state T trig of system S+J2k Aon- However, many choices are possible, 
for example: 

T trisl = VP^Pi ^ 2 P^ Mk \H)(^\ ® Ti ® • • • ® T^Pf+E.**) ® |^X^I , 

kl 

where v is a suitable normalisation factor (projections do not preserve norm). State 
Ttrigi entails a maximal entanglement of identical systems within S + Ylk^'k- This 
is not in agreement with Rule 14 because for each k, the separation status of system 
S' k is D k as is the support of the state \<p\). 

An attempt that entails a minimal entanglement of identical systems that is still 
compatible with our ideas on separation status is: 

T t ri g 2 = 22 VPkPi v k vi\N ki <g> Ti <g> ■ ■ ■ <g> T fc _i <g) T fe+ i®> 

kl 

■ ■ ■ ® T z _i ® Ti +1 ® • • • ® J N ® \ip k )(H ■ (6.14) 

The form of the summands is valid just for non-diagonal terms k < I, but the sum 
must be understood as consisting of all non-diagonal terms, k ^ I, with operators 

W fci = P^ +1 )(T fc ®|^))((^|®T,)P^ +1 ) 

on the extended Hilbert space H* +Mfc+Mi of system S + S' k + S' h as well as the 
diagonal ones with the operator 

W fefc = P^ +1 )(|^)(^|®T fe )P^ +1 ) 
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on the Hilbert space of system S + S' k . Normalisation factor is k is determined by the 
condition 

V k = — ^=^= . 

y/tr[W kk ] 

Then, T trig2 satisfies the normalisation condition of a state. Indeed, the trace of 
a tensor product of operators is a product of the traces of the operator factors, 
see Proposition 24. As tr[\ip k )(ipi\] = 5m, only the diagonal elements of the sum 
contribute. We have 

Proposition 22 The trace of operator \N k i for k ^ I is zero. 

For proof, see next section. 

Proposition 22 implies immediately that the trigger state of A lon) obtained from 
Eq. (I6.14p and (|6.13|) . is given by Eq. f 16 . 6 j) . However, the right-hand side of Eq. 
(16. 6 p is not the gemenge structure of the state because state T tr i g 2 does not satisfy 
the conditions of Rule 11. Hence, our first choice of dynamics is not supported by 
experimental evidence. 

Another attempt is to take just the diagonal elements of T trig2 : 

T t ri g 3 = ^Pk^i <8> • • • ® T fc _i <g> W fcfc <g) J k+1 g) • • • <g> Tiv <8> \^pk)(ipk\ (6.15) 
k 

State T tr ig2 of S + Aon contains many correlations between systems Ai on and 
S + Y2k^'k- The only correlations that remain in T trig3 are, for each k, those between 
the trigger states ipk of Aon an d states ^|W fcfc of system S + S' k . 

States Tt r i g i, T tr i g 2 or T tr i g 3 describe S being inside A and the registration is 
running. A part of this registration is the registration of A. Hence the observables 
that can be measured at this stage of the experiment must commute with 1 eg) A to 
be jointly measurable with it. However, we have: 

Proposition 23 Let B be a sharp observable of system of system S + Y2k + Ai on 
that commutes with ^-s+Y. k s k ® A. Then 

tr[BT trig2 ] = tr[BT fri53 ] . 

The proof of Proposition 23 will be given in the next section. Proposition 23 means 
that the states T trig2 and T trig3 cannot be distinguished from each other at this stage 
of the experiment. If the detector is non-absorbing, and system S is released from 
it, then the existing experimental evidence supports the notion that the state after 
the releasing is T trig3 with the gemenge structure given by the right-hand side of Eq. 
(I6.15P rather than T trig2 . 

This motivates the following assumption: 
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Rule 17 Let a microsystem S be detected by a detector Ak and let the measurement 
coupling modified as above satisfy Eq. A6.13\) . Then the state of S + J2k^'k + Aion is 



T 



trig 



^2 PfcdV'ioXV'iol <8> Ti) ® • • • <g> (|-0fe_io> <-0fe-io| ® T fc _i) 



k 



n / \ n 




<gi (|'0fe+io)('0fc+io| ® T fc+ i) (8) • • • <S> {\i>m)(i>No\ <8> Tjv) • (6.16) 



State Tt r ig satisfies two conditions: 

1. The trigger state of ^4 ion has gemenge structure f)6.6p . That T trig satisfies this 
condition follows from Rule 11. 

2. It contains the same measurable correlation between systems A lon and S + S' k 
as T tI . ig2 . That T trig satisfies this condition follows from Proposition 23. 

All other correlations are erased during the change of separation status of S + 
A. What survives and what is erased is uniquely determined in this case by the 
modified BCL model and Rule 17. In particular, the probability reproducibility 
condition determines states (pki from the initial state ip of Aon uniquely and the 
initial state <f> of S determines states ip\ uniquely. Thus, the evolution from state 
\4>)(4>\ ® Ti <S> ■ ■ ■ ® T^r) (g) IV'oXV'oI t° state T trig is non-unitary but still deterministic. 
Rule 17 is a new general rule which has to be added to quantum mechanics. To 
choose such a rule, we have looked at observations and experiments. 

The correct interpretation of Rule 17 distinguishes two cases. If the detectors are 
absorbing, then state T trig evolves with S staying inside A and being lost always in 
one of the detectors. If they are non-absorbing, then state T trig evolves to 



where T' fc are the end states of Ak including the detector response. In this case, 
states ^fWfcfc evolved to the release of S in state tp\ that is correlated with detector 
signals. Each release is understood as an instance of preparation and the whole 
procedure is a random mixture of these single preparations. In both cases, the end 
state of Aion has then gemenge structure (16. 6p . 

As an example of a system of non-absorbing detectors, the MWPC telescope for 
particle tracking can be mentioned jl5] . It is a stack of the so-called multiwire pro- 
portional chambers (MWPC), which is arranged so that a particle runs through, 




(6.17) 
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exciting each of them. The resulting system of electronic signals contains the infor- 
mation about the particle track. 

A registration by a non-absorbing detector is similar to a scattering of a microsys- 
tem by a macroscopic target. First, let us consider no-entanglement processes such 
as the scattering of electrons on a crystal of graphite with a resulting interference 
pattern [19J or the splitting of a laser beam by a down-conversion process in a crystal 
of KNb0 3 (see, e.g., Ref. [51]). No-entanglement processes can be described by the 
following model. Let the initial state of the target A be T and that of the microsys- 
tem be 0. We assume that the end state of the target is T' and the end-state of the 
microsystem is ip and that we have a unitary evolution: 



The two systems are not entangled by their interaction, hence there is no necessity 
to divide the resulting correlations between S and A in what survives and what is 
erased. The end state is already of the form (16. 17j) and it has a trivial gemenge 
structure. In this way, our corrections to FSQM become trivial. 

A more complicated case is an entanglement scattering. Let microsystem S in 
initial state be scattered by a macrosystem A in initial state T and let this lead 
to excitation of different microscopic subsystems of A. Scattering of neutrons on 
spin waves in ferromagnets, transmutation of nuclei inside A or, for that matter, 
ionising an atom in a gas detector are examples. We have, therefore, a more general 
situation than that in which Rule 17 gives a unique result. It seems that the change 
of status must lead to survival of some correlations between S and A while some 
are erased. However, in this situation it has still to be investigated which is which. 
Clearly, the definitive general rule will depend on the two interacting systems and on 
the interaction Hamiltonian. More theoretical and experimental work is necessary 
to guess the general rule. 

One can wonder whether a more detailed quantum mechanical model can be 
constructed of what happens during a change of separation status. The reason why 
this cannot be done within FSQM is that FSQM is not applicable to changes of 
separation status. Hence, just a new law added to FSQM is needed. 

6.3.1 Proofs of Propositions 22 and 23 

For the proofs, we shall need some simple properties of trace. 

Proposition 24 1. Let operator B has a form B = Ylmn Bmn\ m )( n \j where {\n)} 
is some family of states. Then, 



|0X0| ®T^)M®T\ 




mn 
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2. 



Let operator B on Hilbert space H leave a subspace Hi G H invariant and acts 
trivially on H^. Then 

tr u [B] = tr Hl [B] . 



3. Let three trace-class operator B, Bi and B 2 satisfy B = Bi <g> B 2 . Then 



tr[B] = tr[B 1 ]tr[B 2 ] ■ 



Proof of Proposition 22 Our method is to expand operator \N k i in a basis and to 
use Proposition 24. The basis can be chosen as follows. In our model, states tp\ and 
T k have supports inside D k . Hence, they can be expanded into a basis {0^},m = 
1, 2, • • • of Ho fc . We can choose (pf = ip\. Then, the family of vectors {<f) m } U {0n} 
is a basis for the extended Hilbert space (H r ) 1+Mfc+M * of system S + S' k + <S/. Any 
term in expansion of operator \N k[ into basis {(fi m } U {0n} wm contain composite 
vectors 



Perm[l + M fc ](|#) <g> ® • • • <g> |^ (Mfc) ) <g> Perm[M,](|^ (1) ) ® • • • ® |^ (M() )) 



Perm[M fc ]«# (1) | ® • • • ® (0 r fc (Mfc ))l ® Perm[l + M,]«#| ® <# (1) | ® • • • <g) 



right, where Perm[iV] is an arbitrary but fixed order permutation of iV states and 
the state choices m(-), n(-), r(-) and s(-) are mappings from the first or 
positive integers to positive integers (they need not be injections). Clearly, the two 
composite vectors are orthogonal for any permutations and state choices because 
there is one more element of {0^} in the product defining the first one than in the 
second one, and one more element of {<f> l m } in the product defining the second one 
than in the first one. Hence, the trace is zero, Q.E.D. 

Proof of Proposition 23 According to Sec. 6.1, {ip k } together with if) is an or- 
thonormal basis in a Hilbert space on which A is an observable, 



and all o k are different from each other. 

Let {$fe} be a basis for the Hilbert space of system S + ^2 k S' k . Then, any 
observable B can be written as 



left and 




k 



B=Y1 B kl m n\$k)($ m \ ® 



klmn 
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In order to commute with 1 <g> A, B has to be diagonal in / and n: 

B = Y^ B klm\®k)(®m\ ® • 

klm 

Then, we easily obtain 

tr[T tri g 2 B] = B krn ip m v 2 m {^ l \J 1 <g> • • • <g) T m _i <g> W mm <g> T m+i <g> • • • <g> Tjv|$,) 

kml 

and the same result for ir[T tr ig30'], which proves Proposition 23. 
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Chapter 7 
Conclusion 



Our careful study of quantum-mechanical practice, both experimental and theoret- 
ical, has lead us to quite a new picture of quantum mechanics. It differs from most 
current approaches in several points. 

First, we do not consider a value of an observable of system 5 as a property 
of S but only as an indirect piece of information on such properties. Each value 
of observable is determined by both objective properties of S and the registration 
apparatus. Thus, we reject attempts to view all observables of S, or at least some 
subsets of them, as objective properties of S. 

Second, we find a qualitative difference between quantum microsystems and large 
composite systems. In nature, microsystems occur in families of a fixed type, the 
elements of which are identical, and do not possess any individuality in strict sense. 
Sufficiently large composite systems are different for the following reason. The 
larger a composite system is, the less probable the existence of an identical system 
somewhere in the universe is. 

Third, classical properties are objective quantum properties of macrosystems as- 
sociated with states that are close to maximum entropy. We have shown how this 
principle is applied to classical mechanics by introducing a new class of states, the 
maximum-entropy packets. 

Fourth, we give preparation a different and much greater significance than is 
usually assumed. In a preparation procedure, the prepared system gains its objective 
quantum properties such as states, gemenge structures, averages and variances of 
observables etc. so that it is justified to speak of a physical object. This is what 
we call quantum object. Next, a preparation must separate a microsystem from 
the set of identical microsystems, at least approximately so that it obtains a non- 
trivial separation status. Only then, it can be viewed as an individual system 
and a particular set of observables depending on the status becomes measurable 
on it. In this way, the standard rule for composition of identical microsystems 
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can be weakened. This is justified by the idea of cluster separability. Finally, a 
preparation also separates the microsystem from all othe microsystems so that it 
can be individually manipulated by external fields or matter shields and registered 
by detectors. 

Fifth, macrosystems, as well as generally large composite systems, have much less 
observables than one would expect according to standard quantum mechanics. There 
are two reasons why only very few observables concerning single constituents of such 
a system can be measured. On the one hand, the constituents may be elements of a 
large family of identical systems from which they are not separated by preparation 
and do not, therefore, possess any measurable observables of their own. On the other, 
they are not separated from other microsystems to be individually manipulable by 
fields and shields and registrable by detectors. The differences between macroscopic 
and microscopic systems that are the contents of the second, third and fifth points 
are not due to inapplicability of quantum mechanics to macroscopic systems. Just 
the opposite is true: they result from strict and careful application of standard 
quantum mechanics to macroscopic systems. 

Sixth, we give registration a more specific form than is usually assumed. Any 
apparatus that is to register a microsystem directly contains a detector and the 
'pointer' value that is read off the apparatus is a signal from the detector. We 
assume that each detector contains a macroscopic piece of sensitive matter with 
which the registered microsystem becomes unified and looses its object status. The 
post-Everett trend in the theory of quantum measurement is to avoid collapse of the 
wave function. In a sense, the present paper is heading in the opposite direction. We 
even replace the collapse by a more radical transformation, a change in microsystem 
kinematic description. This change is, in plain words, a kind of disappearance of a 
registered object during its registration. Nevertheless, our result for non-absorbing 
detectors and the collapse idea by von Neumann have some features in common. 

Finally, standard quantum mechanics does not provide information about pro- 
cesses, in which the separation status of microsystems changes. Preparations and 
registrations belong to such processes. Unjustified application of standard rules to 
such processes leads to contradictions with experimental evidence. However, one can 
add new rules to quantum mechanics governing such processes without violating its 
logic. 

The main purpose of Chap. 5 is to show how the new ideas on measurement 
problem work by studying well-understood, restricted class of physical conditions 
in which the following assumptions are a good approximation: (a) non-relativistic 
quantum mechanics, (b) measurement performed directly on microsystems, (c) a 
simple modification of BCL model of measurement and (d) pointer readings being 
signals from detectors. 
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In our future work, we have gradually to remove the restrictions. First, we must 
turn to other models of measurement, for example to different (non-ideal) kinds 
of detectors or to the more realistic unitary premeasurements for which the no- 
go theorems such as Theorem 6.2.1 in Ref. [16J, p. 76, hold. The main point is 
again that the state resulting from the unitary evolution contains information about 
properties of the composite system S + A that could be measured only if more 
observables than of S existed. Thus, a change of this illusory state analogous to 
that given by Rule 17 could be justified, it could again remove the objectification 
problem and the no-go theorems could be avoided. The exact division line between 
correlations that survive and those that are erased might again be determined by a 
careful analysis of observational facts. 

Next, relativistic corrections have been neglected so that all notions and rules 
of non-relativistic quantum mechanics could be used. An extension of the present 
results to relativistic fields seems to be a realistic project. 

Let us finish by giving a summary of how our theory changes quantum mechanics. 
Standard quantum mechanics has the so-called minimal interpretation (see, e.g., Ref. 
[T6] . p. 9). As measurement is concerned, it just postulates the probability repro- 
ducibility and objectification avoiding any deeper theory. It does not worry about 
origin of classical properties and about philosophy. Moreover, common practice of 
quantum mechanics does not take the symmetrisation or anti-symmetrisation over 
all identical systems seriously ignoring all unknown identical systems. This Minimal 
Quantum Mechanics has been in general use and it has been formidably successful. 
Our theory takes over Minimal Quantum Mechanics just giving a new foundation 
to the identical systems that justifies the common practice. To this, it adds a quan- 
tum theory of measurement that is motivated by the new ideas on identical systems 
and satisfies probability reproducibility and objectification requirements. Finally, it 
adds a realist interpretation and a theory of classical systems that do not contradict 
Minimal Quantum Mechanics. 

Acknowledgements 

The authors are indebted to Stefan Janos for help with experimental physics and to 
Heinrich Leutwyler for useful discussions. J. T. acknowledges partial support of the 
Ministry of Education of Czech Republic, research projects MSM6840770039 and 
LC06002. 



126 



Bibliography 



[1] E. Arthurs and J. L. Kelly, Jr., Bell System Tech. J. 44 (1965) 725. 

[2] H. Araki and E. H. Lieb, Commun. Math. Phys. 18 (1970) 160. 

[3] L. E. Ballentine, Quantum Mechanics. A Modern Development, World Scien- 
tific, Singapore, 2008. 

[4] J. S. Bell, On Einstein-Podolsky-Rosen paradox, Physics bf 1 (19964) 195. 

[5] J. S. Bell, Against 'measurement' in Sixty Two Years of Uncertainty, A. I. 
Miller (Ed.), Plenum, New York, 1990. 

[6] J. S. Bell, Helv. Phys. Acta, 48 (1975) 93. 

[7] J. S. Bell, On the problem of hidden variables in quantum mechanics, Rev. 
Mod. Phys. 38 (1966) 447. 

[8] E. G. Beltrametti, G. Cassinelli and P. J. Lahti, J. Math. Phys. 31 (1990) 91. 

[9] G. Birkhoff and J. von Neumann, Ann. of Math. 37 (1936) 823. 
[10] D. Bohm, Quantum Theory, Prentice-Hall, Englewood Cliffs, 1951. 
[11] P. Bona, Acta Phys. Slav., 23 (1973) 149, 25 (1975) 3, 27 (1977) 101. 
[12] M. Born, Phys. Blatter 11 (1955) 49. 

[13] V. B. Braginsky and F. Ya. Khalili Rev. Mod. Phys. 68 1 (1996). 

[14] C. Brukner and M. Zukowski, Bell's Inequalities: Foundation and Quantum 
Communication, arXiv:0909.2611, 

[15] J. Bub, Interpreting the Quantum World, Cambridge University Press, Cam- 
bridge, UK, 1999. 

[16] P. Busch, P. J. Lahti and P. Mittelstaedt, The Quantum Theory of Measure- 
ment, Springer, Heidelberg, 1996. 

127 



[17] C. Chevalley, in Niels Bohr and Contemporary Philosophy, ed. by J. Faye and 
H. Holse, Kluwer, Dordrecht, 1994. 

[18] F. Coester, Int. J. Modern Phys. 17 (2003) 5328. 

[19] C. Davisson and L. Germer, Phys. Rev. 30 (1927) 705. 

[20] H. D. Doebner, P. Stovicek and J. Tolar, Rev. Math. Phys. 13 (2001) 799. 705. 

[21] F. Exner, Vorlesungen iiber die physikalischen Grundlagen der Naturwis- 
senschaften, Deuticke, Leipzig, 1922. 

[22] B. d'Espagnat, Veiled Reality, Addison- Wesley, Reading, 1995. 

[23] J. Gemmer, M. Michel and G. Mahler, Quantum Thermodynamics. Emergence 
of Thermodynamic Behavior Within Composite Quantum Systems, LNP 657, 
Springer, Berlin, 2004. 

[24] A. Bassi and G. Ghirardi, Phys. Letters A 275 373 (2000). 

[25] A. M. Gleason, J. Math. Mech. 6 (1957) 885. 

[26] S. Goldstein, J. L. Lebowitz, R. Tumulka and N. Zanghi, Phys. Rev. Letters 
96 (2006) 050403. 

[27] D. M. Greenberger, M. A. Home, A. Shimony, and A. Zeilinger, Bell's theorem 
without inequalities, Am. J. Phys. 58 (1990) 1131. 

[28] R. Haag, Local Quantum Physics. Fields, Particles, Algebras, Springer, Berlin 
1992. 

[29] P. Hajicek and J. Tolar, Found. Phys. 39 (2009) 411. 
[30] P. Hajicek, Found. Phys. 39 (2009) 1072. 

[31] P. Hajicek, Quantum measurement problem and cluster separability, 
larXiv: 1001 .18271 To be published in Found. Phys., DOI: 10.1007/sl0701-010- 
9506-3. 

[32] P. Hajicek, A new approach to quantum measurement problem: cluster sepa- 
rability, larXiv:1011.1732l 

[33] L. Hardy, Quantum mechanics, local realistic theories, and Lorentz invariant 
realistic theories, Phys. Rev. Letters 68 (1992) 2981. 

[34] K. Hepp, Helvetica Phys. Acta, 45 (1972) 237. 



128 



[35] C. J. Isham, Lectures on Quantum Theory. Mathematical and Structural Foun- 
dations, Imperial College Press, London 1995. 

[36] E. T. Jaynes, Probability Theory. The Logic of Science, Cambridge University 
Press, Cambridge (UK) 2003. 

[37] B. Kayser and L. Stodolsky, Phys. Lett. B 395 (1995) 343. 

[38] B. D. Keister and W. N. Polyzou, in Advances in Nuclear Physics, ed. J. W. 
Negele and E. Vogt, Plenum, New York 2002. Vol 20. 

[39] C. Kittel, Introduction to Solid State Physics, Wiley, New York 1976. 

[40] S. Kochen and E. P. Specker, J. Math. Mech. 17 (1967) 59. 

[41] J. Kofler and C. Brukner, Phys. Rev. Lett. 99 (2007) 180403. 

[42] K. Kraus, States, Effects and Operations, Springer, Berlin, 1983; LNP 190. 

[43] W: J. Mullin and F. Laloe, Classical phases and quantum angles in the descrip- 
tion of interfering Bose-Einstein condensates, larXiv:0 908.3012, 

[44] A. J. Leggett, J. Phys.: Condens. Matter 14 (2002) R415. 

[45] W. R. Leo, Techniques for Nuclear and Particle Physics Experiments, Springer, 
Berlin, 1987. 

[46] J. M. Levy-Leblond, in Group Theory and Its Applications, Vol. 2, ed. E. M. 
Loebl, Academic Press, New York 1941. 

[47] N. Linden, S. Popescu, A. J. Short and A. Winter, larXiv:0812.2385l 

[48] G. Ludwig, Foundations of Quantum Mechanics I, Springer, New York, 1983; 
Foundations of Quantum Mechanics II, Springer, New York, 1985. 

[49] G. Ludwig, An Axiomatic Basis for Quantum Mechanics 1, Springer, Berlin, 
1985; An Axiomatic Basis for Quantum Mechanics 2, Springer, Berlin, 1987. 

[50] C. W. Misner, K. S. Thorn and J. A. Wheeler, Gravitation, Freeman, San 
Francisco, 1973. 

[51] L. Mandel and E. Wolf, Optical Coherence and Quantum Optics, Cambridge 
University Press, Cambridge, 1995. 

[52] Methods of Experimental Physics, Vol. 4, Part A, ed. by V. W. Hughes and H. 
L. Schultz, Academic Press, New York, 1967. 



129 



[53] N. F. Mott, Proc. Roy. Soc. London Series A 126 (1929) 79. 

[54] J. von Neumann, Mathematical Foundation of Quantum Mechanics, Princeton 
University Press, Princeton NJ, 1955. 

[55] W. Pauli, Die allgemeinen Prinzipien der Wellenmechanik, Springer, Berlin, 
1990. 

[56] A. Peres, Quantum Theory: Concepts and Methods, Kluwer, Dordrecht, 1995. 

[57] J. R. Pierce, An Introduction to Information Theory. Symbols, Signals and 
Noise, Dover, New York, 1980. 

[58] C. Piron, Foundations of Quantum Physics, Benjamin, Reading 1976; C. Piron, 
Found. Phys. 2 (1972) 287. 

[59] K. R. Popper, The Logic of Scientific Discovery, Hutchinson, London 1959. 

[60] D. Poulin, Phys. Rev. A 71 (2005) 022102. 

[61] S. Popescu, A. J. Short and A. Winter, [arXiv:quant-ph/0511225| (2006). 

[62] H. Primas, Chemistry, Quantum Mechanics and Reductionism. Springer, 
Berlin, 1983. 

[63] M. Reed and B. Simon, Methods of Modern Mathematical Physics, Vol. I, Aca- 
demic Press, New York, 1972. 

[64] D. E. Rutherford, Proc. Roy. Soc. (Edinburgh), Ser. A, 62 (1947), 229; 63 
(1951), 232. 

[65] G. L. Sewell, Quantum Mechanics and its Emergent Macrophysics, Princeton 
University Press, Princeton, 2002. 

[66] C. E. Shannon, Bell Syst. Tech. J. 27 (1948) 379, 623. 

[67] A. Tonomura, Am. J. Phys. 57.(1989) 117. 

[68] D. Twerenbold, Rep. Progr. Phys. 59 (1996) 239. 

[69] W. Thirring, Lehrbuch der Mathematischen Physik, Springer, Berlin, 1980. 

[70] D. Wallace, The quantum measurement problem: state of play, 
larXiv:0712.0T49l l [quant-ph]. 

[71] A. Wehrl, Rev. Mod. Phys. 50 (1978) 221. 



130 



[72] S. Weinberg, The Quantum Theory of Fields Vol. I, Chap. 4. Cambridge Uni- 
versity Press, Cambridge 1995. 

[73] E. P. Wigner, Group Theory, Academic Press, New York, 1959. 

[74] F. Wilczek, Phys. Rev. Letters 49 (1982) 957. 

[75] D. Giulini, E. Joos, C. Kiefer, J. Kupsch, I.-O. Stamatescu, H. D. Zeh, Deco- 
herence and the Appearance of Classical World in Quantum Theory, Springer, 
Berlin, 1996. 

[76] W. H. Zurek, Rev. Mod. Phys., 75 (2003) 715. 



131 



